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Introduction 


The  research  presented  in  this  report  extends  earlier  work 
oy  the  L*N*K*  Corporation  CStocknan  10313*  CLambird  19803*  and 
CStockman  19793*  The  registration  of  an  image  to  a  map  or  an 
image  to  an  image*  possibly  from  different  sensors*  is  treated  in 
this  report*  The  LNK  registration  procedure  is  designed  to 
handle  these  tasks  in  a  fast  and  reliable  manner*  The  present 
form  of  the  algorithm  is  the  result  of  a  long  development  process 
in  which  both  real  edqes  and  feature  points  such  as  road 
intersect  ions  played  a  role*  Due  to  the  simplicity  and 
reliability  of  feature  points*  we  have  focused  on  the  analysis  of 
their  use  in  registrat ion* 

Registration  using  point  features  consists  of  two  facets* 
First*  curves  and  straight  lines  must  be  extracted  from  the  image 
in  order  to  locate  high  curvature  points  and  intersections* 
Second*  the  extracted  feature  points  must  be  connected  to  form 
abstract  vectors  and  the  registration  must  be  performed  with  the 
abstract  vectors* 

The  present  study  consisted  of  further  algorithm  development 
for  feature  point  detection  and  the  analysis  of  the  registration 
procedure*  given  that  a  set  of  feature  points  has  been  selected* 
The  algorithm  development  consisted  of  the  design  and 
impl ementat ion  of  a  state-space  search  procedure  for  the  merging 
of  short  curve  segments  into  a ore  reliable  curves  coupled  with  an 
intersection  detector  for  curves* 

The  analysis  of  the  registration  procedure  treated  a  variety 
of  aspects  of  the  problems  including  abstract  vector  selection 
procedures*  abstract  triples  of  ooints  rather  than  vectors* 
vector  labelling  procedures*  and  registration  accuracy*  These 
results  are  based  on  both  analytic  and  simulation  models*  while 
these  results  do  not  comprise  a  complete  picture  of  algorithm 
performance*  they  have  proven  useful  in  designing  modifications 
to  the  existing  algorithm* 

The  ultimate  usefulness  of  the  analysis  will  not  be  clearly 
established  until  the  intersection  procedures  have  been  applied 
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to-  a  suostantial  number  of  pictures  in  order  to  obtain  a  variety 
of  point  images*  The  theory  and  simulations  provide  insight  into 
the  manner  in  which  such  a  study  shoulo  be  conducted*  In 
particular  the  intersection  detection  procedure  requires  further 
testing  to  determine  the  probability  of  prominent  features  being 
detected* 

Experience  with  the  INK  procedure  has  led  us  to  the 
conclusion  that  for  a  wide  range  of  numbers  of  image  feature 
points*  the  primary  problem  in  attaining  an  accurate  registration 
is  to  guarantee  an  adequate  number  of  correctLy  corresponding 
abstract  vectors  without  incurring  a  heavy  computational  cost* 
Thus,  unless  a  large  number  of  noise  points  are  present* 
non-corresponding  points  do  not  tend  to  reduce  registration 
accuracy*  In  light  of  these  observations*  much  of  our  work  has 
been  directed  towards  procedures  for  obtaining  an  adequate  number 
of  matching  vectors  or  triples*  rather  than  the  local  accuracy  of 
the  procedure.  In  the  correlation  literature*  for  these  two 
aspects  of  registration*  the  probability  of  being  roughly  correct 
and  the  probability  of  being  very  accurate  given  that  you  are 
roughly  correct  are  treated  independently* 

The  results  of  the  registration  analysis  indicate  that  a 
combination  of  sampling  of  abstract  vectors*  usfno  triples  of 
feature  points  and  labelling  feature  points  as  to  type*  such  as  a 
straight  road  intersection  in  a  field*  can  reduce  the  number  of 
feasible  registration  t ransf ormat ions  sufficiently  to  allow  rapid 
checking  of  the  remaining  possibilities*  If  this  conclusion  is 
corroborated  by  exper imentat ion  on  real  data*  the  clustering 
portion  of  the  algorithm  can  be  discarded*  thus  eliminating  a 
costly  and  somewhat  unpredictable  portion  of  the  program* 

The  cost  of  feature  extraction  is  high*  making  it  desirable 
to  process  only  a  portion  of  the  image  if  possible*  The  analysis 
of  sampling  procedures  described  in  the  report  can  be  extended  to 
provide  design  criteria  for  a  sequential  registration  program 
which  examines  only  as  much  of  the  image  as  is  necessary  to 
determine  the  registration  transformat  ion; 


Basic  L  *N  *K  •  Registration  Technique 


This  section  presents  an  overview  of  the  basic  LNK 
registration  procedure*  The  goal  of  registration  is  to  use 
automatically  extracted  features  froa  two  images  for  an  image  and 
a  map)  to  find  the  best  global  transformation  that  maps  one  image 
to  the  other* 

The  LNK  procedure  is  able  to  achieve  this  even  when  many 
local  mismatches  of  features  occur*  The  three  basic  steps  of  the 
method  are:  1)  Primitive  features*  such  as  tine  segments*  edge 
segments*  intersections*  or  high  curvature  points  are 
automatically  extracted*  These  features  may  be  parameterized* 
such  as  by  position*  by  length*  or  by  orientation*  2)  Assume 
all  features  of  one  type  can  correspond  to  one  another*  For 
example*  an  intersection  of  three  lines  in  one  image  can 
correspond  to  any  three-line  intersection  in  the  second  image* 
However*  only  one  of  these  correspondences  is  the  true  one*  Now* 
for  each  of  the  possible  correspondences •  find  the  translation 
and  rotation  that  maos  one  feature  to  the  other*  Let  that 
translation  and  rotation  transformation  be  represented  by  the 
triple  (  9  *  x*  y) •  Place  a  unit  of  weight  in  the  bin  in  the 
three-dimensional  histogram  that  represents  <0*  x*  y>*  This 
process  is  repeated  until  all  possible  feature  correspondences 
have  been  transformed*  3)  Locate  any  prominent  clusters  of  (  9  * 
x*  y>  in  the  histogram*  Each  cluster  represents  a  set  of 
features  in  one  image  that  could  be  naooed  to  corresponding 
features  in  the  other  image  by  one  particular  (9*  x*  y>*  The 
best  global  transformation  is  defined  by  the  (9*  x*  y)  of  the 
largest  cluster*  This  transformation  provides  the  largest  number 
of  local  correspondences* 

This  method  works  because  the  correct  transformation  will 
show  up  as  a  large  cluster*  while  the  wrong  transformations  will 
tend  to  be  distributed  randomly  throughout  the  histogram*  The 
procedure  may  also  be  performed  iteratively  by  making  the  bin 
sizes  smaller  and  smaller* 

• 
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As  stated  in  Step  1  of  the  procedure*  aany  features  could  be 
used  to  perfora  the  transformation*  Not  all  features  are  eaually 
desirable*  For  example*  the  lengths  of  edge  segaents  or  line 
segaents  usually  cannot  be  accurately  determined*  Snail 
perspective  changes  can  alter  the  curvature  of  high  curvature 
points  considerably*  LNK  feels  that  intersections  *ould  produce 
acre  accurate  transforaations  as  they  can  be  deterained  no re 
accurately*  LNK  has  perforaed  enough  experiaents  to  show  that 
even  if  up  to  90  percent  of  tha  detected  features  in  the  iaage  do 
not  match  with  any  features  in  the  map*  the  correct 
transf oraation  wilt  stilt  fora  enough  of  a  cluster  to  be 
detected* 

All  of  the  versions  of  LNK 's  registration  techniaue  require 
clustering  to  be  done  in  e-space*  The  cluster  space  is  filled 
with  points*  each  one  of  which  represents  the  aatching  of  one 
iaage  eleaent  to  one  nap  eleaent  on  the  basis  of  local  features 
alone.  A  cluster  of  points  in  this  space  represents  a  globally 
good  transformat  ion  that  Batches  aany  iaage  elements  to 
corresponding  aap  elements*  A  threshold  aay  be  set  to  determine 
the  strength  of  an  acceptable  cluster;  for  instance*  we  aight 
demand  that  half  of  the  aap  eleaents  be  matched  by  image 
elements*  A  unique  cluster  will  typically  result*  but  there  may 
be  no  cluster  formed  in  the  case  where  there  is  poor  feature 
detection  or  where  the  iaage  and  aap  really  do  not  aatch*  On  the 
other  hand*  several  strong  clusters  can  result  as  in  the  case 
where  symmetry  produces  several  good  aatching  possibilities*  LNK 
has  used  two  different  clustering  techniques  -  hierarchical 
clustering  and  variable  resolution  clustering* 

In  the  hierarchical  technique*  clustering  for  *  l 8*  x«  y> 
was  done*  first  on  e  alone*  and  then  in  (  a*  y)-space  given  a 
fixed  9 •  This  technique  was  useful  for  human  interaction  since 
9  —space  could  be  viewed  as  *  histogram  and  (  x*  y)-space  as  a 
scatter  plot* 

The  6 -space  was  divided  into  360  one-degree  bins  and  the 
rotation  69  that  rotated  the  image  edge  to  the  aap  edge  was 
entered*  After  smoothing*  the  top  three  peaks  of  the  histogram 


were  chosen  for  the  second  step*  For  each  of  the  transf oraations 
that  had  a  A 9  in  one  of  the  three  peaks*  the  (  x*  y>  in  that 
transf oraation  was  recorded  in  (  a*  y)-space*  The  resulting 
scatter  plot  could  then  be  searched  for  clusters* 

Intuitively*  clustering  can  be  done  by  aoving  a  box  around 
the  cluster  space  to  see  if  there  is  soae  position  at  which  an 
acceptable  nuaber  N  of  points  are  inside  the  box*  if  so*  the 
coordinates  (  |«  x*  y>  of  that  position  aark  a  cluster  center 
and  represent  a  good  registration  transf oraation*  A  coaputer 
impleaentation  can  be  specified  when  we  choose  the  size  of  the 
box  and  the  set  of  different  positions  (bins)  at  which  we  wilt 
try  to  fill  it* 

There  is  one  final  note  on  the  clustering  procedure*  Since 
the  cluster  space  is  quantized  by  the  bins*  overlapping  sets  of 
bins  are  actually  used  so  that  clusters  are  not  lost  by  being 
split  by  bin  boundaries*  Thus  each  triple  (  0*  x*  y)  is  actually 
distributed  to  several  bins  rather  than  one* 

To  use  detected  straight  edges  instead  of  abstract  edges* 
soae  changes  need  to  be  aaoe  in  the  clustering  algoritha*  Real 
edge  detectors  create  edges  with  two  general  defects*  First* 
only  snail  segaents  of  the  true  edge  are  usually  detected 
primarily  because  only  a  fixed  size  detector  is  used*  second* 
the  detectors  may  overshoot  the  true  termination  of  an  edge  at  a 
corner • 

In  order  to  deal  with  the  above  prebleas  when  real  edges  are 
used*  the  aatching  of  iaage  vectors  to  asp  vectors  aust  be  made 
to  be  sloppy*  The  image  edge  element  aust  be  allowed  to  slide 
along  the  potentially  aatching  aodel  edge*  ffstheaat ically  this 
generates  an  infinite  set  of  triples  (e*  a*  y)  constrained  to  be 
on  a  line  segment  in  cluster  space*  In  practice  the  endpoints 
(8*  X}  *  yx  >  and  (0*  mz  »  y2  >  are  in  separate  bins*  points  are  also 
contributed  to  all  bins  in  between  them* 

The  work  in  this  report  focuses  on  the  situation  where 
abstract  vectors  rather  than  real  edges  are  used*  so  tone  of  the 
considerations  outlined  above  will  not  play  a  direct  role  in  the 
following  analysis*  In  Sections  4  and  S  we  consider  various 
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procedures  which  way  allow  us  to  dispense  with  the  clustering 
entirely* 


Curve  Intersection  Detection 


5  . 

The  detection  of  intersections  of  straight  line  segments  in 
imagery  may  not  provide  in  adequate  suoply  of  feature  points  for 
the  LNK  registration  orocedure.  Even  if  a  substantial  number  of 
such  intersections  are  found,  the  st ra iqht-l ine  intersec tions  may 
have  few  distinguishing  characteristics.  To  supplement  or 
possibly  replace  these  inter  sect  ions  ,  the  intersections  of  curves 
nay  be  used.  This  more  general  procedure  is  likely  to  be  useful 
in  imaqes  which  contain  little  or  no  man-made  imagery,  hence  feu 
straight  lines.  Curved  i nter sect i ons  may  be  labelled  by  a 
feature  vector  describing  the  curvature  of  the  intersecting 
curves.  This  feature  vector  may  then  be  used  to  greatly  reduce 
the  number  of  potential  matches  between  image  and  feature  points. 

An  algorithm  to  perform  this  curve  detection  is  given  in 
this  section.  Testing  of  this  algorithm  on  aerial  imagery  will 
be  necessary  to  evaluate  it  properly.  The  formation  of  curves 
from  local  evidence  of  edge  content  can  oe  a  costly  procedure  due 
to  the  many  ways  in  which  short  curved  segments  can  be  joined  to 
form  longer  curves.  Experimentation  on  aerial  images  should  be 
performed  to  estimate  some  bounds  on  the  computational 
requirements  for  a  range  of  pictures,  as  well  as  estimating  the 
potential  benefits. 

An  extension  of  the  curve  intersection  detector  to 
intersections  of  oaired  parallel  curves,  such-  as  roads,  is 
currently  under  consideration.  This  restricted  type  of 
intersection  detection  may  substantially  reduce  the  number  of 
rotation  and  translation  t ransf ormat ions  which  must  be  considered 
in  the  latter  part  of  the  LNK  procedure. 
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3.1  Continuous  Curve  extraction 


The  extraction  of  continuous  curves  consists  of  two 
operations;  the  detection  of  edge  points  and  the  Linking  of  edge 
points  into  continuous  curves.  The  details  of  this  procedure  are 
given  in  CStockman  19311.  We  review  this  material  briefly  to 
provide  insight  into  the  input  of  the  curve  intersection  program 
described  in  Section  3.2. 

A  point  can  be  discovered  to  be  an  edge  point  by  testing  the 
tonal  values  in  halfplanes  on  either  side  of  the  point.  Figure 
3.1  shows  the  32  directions  tor  edges  that  were  used  in  the 
reported  research.  A  general  purpose  routine  (RPSNL)  exists 
which  can  differentiate  either  g»  1$,  or  32  directions  around  the 
circle.  Three  "Basks"  for  computing  directional  gradient  values 
are  shown  in  Figure  3.2.  Direction  d  *  1  is  the  vertical 
direction  with  higher  tones  at  the  right  while  direction  d  *  7  is 
nearly  horizontal  with  higher  tones  below.  Given  a  pixel  (x,y> 
in  the  image,  the  gradient  magnitude  in  each  direction  d  *  1»2». 
.  .,52  can  be  computed  by  adding  and  subtracting  tonal  values  as 
indicated  by  the  masks  in  Figure  3.2.  Division  by  a 
normalization  factor  is  performed  to  take  into  cons iderat ion  the 
number  of  pixels  used  and  to  make  the  magnitude  geomet r i cal ly 
isotropic.  The  magnitude  and  direction  of  the  gradient  at  (x,y) 
is  taken  to  be  the  magnitude  and  direction  where  a  maximum  is 
achieved.  Note  that  for  d  *  17  the  mask  for  d  *  1  could  be  used 
with  reversal  of  the  sign  on  the  magnitude,  so  that  only  16  masks 
are  actually  applied.  Only  4  directions  are  tried  at  resolution 
8.  The  points  selected  by  the  masks  sometimes  differ  from  the 
ideal  due  to  implementation  cons  id«rat ions . 

The  oresence  of  one  edge  may  interfere  with  tj»e  detect*on  of 
neighboring  edges.  Suppression  of  known  edge  points  m*y  be 
required  in  this  situation. 

By  examining  the  neighbors  of  edge  pixels*  ft  is  sometimes 
possible  to  determne  the  two  pixels  which  continue  the  edge  in 
the  forward  and  backward  direction.  (Forward  edge  traversal  by 
definition  keeps  higher  tonal  values  toward  the  right.) 
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Figure  3.1  32>dlr*etlonal  codas  and  associated  angular  heading 
In  dsgrsss. 
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Figure  3.  2  Three  masks  used  for  compuclng  Che  gradient  at 

a  point  in  directions  1,4,  and  7.  The  directional 
resolution  is  one  32nd  of  the  circle;  N  is  the 
normalization  factor.  Edge  directions  of  1,4, 
and  7  are  indicated  by  the  arrow. 
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Continuation  can  be  determined  from  the  spatial  orientation  of 
the  neighbors  and  the  gradient  direction  of  the  neighbors  all 
relative  to  that  of  the  original  pixel.  When  deternin ing  which 
neighbor  is  the  best  forward  and  backward  continuation.  a 
variable-sized  neighborhood  is  scanned  in  a  soiral  pattern.  See 
CStocfcisan.  19793  for  a  detailed  discussion  of  continuous  edge 
l ink ing. 
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5.2  Curve  Intersection  Detection  Algorithm 


This  section  describes  an  algorithm  for  the  detection  of 
curves  and  the  suoseguent  detection  of  intersections*  This 
algorithm  has  been  implemented  and  undergone  limited  testing. 
Further  testing  and  algorithm  refinement  are  now  being 
cons idered. 

The  first  phase  of  curve  detection  is  the  computation  of  the 
gradient  and  linking  as  described  in  Section  3.1. 

The  neat  stage  in  the  intersection  detection  process  is  to 
perform  a  piecewise  linear  approximat ion  to  the  curve*  A 
sequential  procedure  for  extracting  the  linear  approximation  is 
performed*  The  algorithm  Sequentially  examines  the  stra ight-t ine 
segments  joining  one  end  of  the  curve  to  its  neighbor*  joining 
the  end  to  the  third  point  on  the  curve*  etc.  For  each  line 
segment*  the  sum  of  sguared  error  between  the  curve  and  the 
segment  are  computed.  As  soon  as  the  error  exceeds  a  threshold* 
the  line  segment  joining  the  previous  point  to  the  endpoint  of 
the  curve  is  selected. 

The  error  associated  to  each  ooint  on  the  curve  is  the 
vertical  or  horizontal  distance  from  the  point  to  the  straight 
line  segment.  For  purposes  of  numerical  stability*  the 
horizontal  distance  is  chosen  it  the  absolute  value  of  the  slope 
is  greater  than  one  and  the  vertical  distance  is  chosen 
otherwise*  The  suns  of  squared  error  is  the  sun  of  the  sguared 
distance  for  each  point  on  the  curve  to  the  corresponding  point 
on  the  line  segment* 

Having  found  one  line  appromiaat ing  Dart  of  the  curve*  the 

i 

algorithm  repeats  the  process  usinq  the  endpoint  of  the 
just-fitted  line  segment  interior  to  the  curve  as  the  new 
starting  endpoint*  The  procedure  repeats  until  the  whole  curve 
is  fitted*  Though  this  procedure  guarantees  no  global  ootimum 
least  squares  error  fit  to  the  entire  curve*  it  has  performed 
well  in  practice  and  is  much  easier  than  global  optimization 
procedures • 

The  next  stage  in  the  intersection  detection  process  is  the 
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merging  of  curves.  A  hierarchical  merging  procedure  is  now 
invoked  (or  finding  the  longest  oossible  curves  that  can  be 
formed  from  shorter  curves  which  fit  well  together.  This  merging 
procedure  emoloys  an  optimization  procedure  known  as  st ate-soace 
search  to  find  good  merges.  Initially*  a  list  of  oairs  of  curves 
which  are  candidates  for  merges  is  set  uo.  Each  oair  of  curves 
in  the  image  has  associated  with  it  a  set  of  merits  indicating 
the  quality  of  the  potential  mergino.  At  present*  four  measures 
of  merge  merit  are  calculated  and  combined  to  form  a  sinqle  merit 
figure.  Only  curve  pairs  with  a  merit  figure  less  than  a 
threshold  are  placed  en  the  list  of  merge  pairs*  called  the  open 
list  for  merges.  Any  pairs  of  curves  which  have  an  endooint  of 
the  piecewise  tinear  aporoximat ion  of  one  curve  close  to  an 
endooint  of  the  piecewise  linear  approximation  of  the  other 
curve*  but  which  fails  to  satisfy  the  merge  criterion*  is  placed 
on  a  list.  This  list*  called  the  intersecton  open  list*  contains 
curves  which  are  likely  candidates  for  intersect  ions. 

Processing  proceeds  by  removing  from  the  merge  open  list  the 
curve  pair  with  the  highest  merit.  Let  A  and  B  denote  the  curves 
and  let  A  have  endpoints  and  *i  and  let  B  have  endpoints  b^ 
and  b2  and  assume  aj  and  bj  are  the  close  endpoints.  Then  a  new 
symbol*  C,  denoting  the  merging  of  those  curves*  is  created.  Any 
curve  0*  such  that  <A*D>  is  o'*  the  ooen  list  and  a  is  the 
endpoint  near  0  resutts  in  the  pair  (C,&>  being  put  on  the  merge 
open  list.  Similarly,  any  curve  E  such  that  CD,E>  is  on  the  open 
list  and  b2  is  the  vertex  near  E,  results  in  the  pair  CC,E)  being 
put  on  the  merge  open  list. 

The  merging  of  curves  may  be  intuitively  thought  of  as  a 
soldering  of  the  wires  which  are  close  together.  The  above 
condition  for  forming  new  pairs  can  be  interpreted  as  saying  we 
first  solder  two  wires  together  and  then  allow  this  soldered  wire 
to  form  a  pair  with  any  wire  near  its  free  ends  but  not  with 
wires  near  the  soldered  joint.  For  each  new  pair  placed  on  the 
open  list*  a  set  of  merge  merits  is  comouted. 

A  pair  of  curves  which  are  meraed  may  result  in  a  new  curve 
which  has  no  other  curves  with  which  it  may  be  merged.  This  can 
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happen  in  two  ways.  First  the  new  curve  nay  not  have  either 
endpoint  near  the  endpoint  of  any  other  curve.  In  this  case  the 
curve  is  removed  from  the  ooen  list  and  plays  no  further  role  in 
the  merge  process.  Second,  the  curve  may  be  near  an  endpoint  of 
another  curve,  but  the  new  curve  oair  may  not  satisfy  the  merge 
criterion.  if  this  occurs,  the  pair  is  put  on  the  intersection 
open  list. 

The  process  of  removing  the  best  pair  from  the  open  list  and 
generating  merged  curves  and  possibly  new  pairs  tor  the  merge 
list  or  the  intersection  list  is  repeated  until  the  merge  open 
list  is  empty.  At  this  point  the  intersection  open  list  contains 
the  list  of  curve  pairs  which  represent  possible  intersections. 

The  merit  function  for  merging  pairs  is  based  on  four 
measures  of  merit,  namely  distance,  alignment,  gooa  continuation, 
and  strength.  Each  merit  measure  assumes  values  between  zero  and 
one.  The  four  merit  measures  are  added  and  the  sum  divided  by 
four  to  yield  a  merit  function  taking  values  from  zero  to  one, 
where  lower  values  indicate  better  merges. 

Distance,  the  first  of  the  merit  measures,  is  defined  to  be 
the  distance  between  the  nearby  endpoints  of  the  curves  divided 

by  the  maximum  length  of  the  curves.  By  the  distance  between 

nearoy  endpoints,  we  refer  to  the  quantity 
d  *  m  i  n  ( a ( at  , b2  )  ,  d ( a2  , bx  ) ) 

where  one  curve  has  endpoints  la^  ,a2),  the  second  curve  has 
endpoints  (bj,b2),  and  d  denotes  Euclidean  dist*nce.  An 
alternate  distance  measure  may  be  defined  by  using  the  average  of 
the  curve  lengths  rather  than  the  maximum  but  this  has  not  yet 
been  explored. 

Alignment  refers  to  the  di fference  in  direction  between  the 
two  close  ends  of  the  curves  which  may  be  merged.  The  line 
segments  on  the  close  ends  of  the  curves  are  used  for  this 

measure.  Those  line  segments  have  directions  Induced  by  the 

direction  of  the  underlying  curves.  Directions  from  zero  to  ?6p 

are  quantized  in  32  levels  and  assigned  a  value  as  in  Figure  3.1. 

The  continuation  merit  of  a  curve  merge  is  defined  to  be  the 

average  of  the  angles  formed  by  the  end  line  segments  of  the 


curves  with  the  tine  joining  their  nearby  end  points*  If  the 
curves  align  property*  each  angle  is  zero*  Finally*  the  strength 
of  a  curve  is  a  measure  of  the  intensity  of  the  image  on  the 
curve*  For  a  line  segment  in  the  piecewise  linear  approximation 
to  a  curve*  we  define  the  strength  of  the  line  segment  to  be  the 
average  grey  level  for  points  on  the  curve  between  the  endpoints 
of  the  line  segment*  The  strength  of  a  curve  is  thus  defined  to 
be  the  average  strength  merit  of  its  line  segments*  We  then 
define  the  strength  of  a  curve  pair  to  be  one  minus  the  maximum 
of  the  strength  of  the  two  curves  divided  by  the  maximum  grey 
level  in  the  picture* 
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4*  Registration  Analysis 

The  analysis  of  the  LNK  registration  procedure  involves  a 
study  of  various  saaplin$  questions  and  in  one  approximation  to 
the  problem*  an  analysis  of  discrete  iaage  correlation*  The 
results  of  this  section  represent  our  efforts  to  treat  those 
aspects  of  the  problea  ue  consider  aost  relevant  to  registration 
accuracy*  Ue  were  unable  to  find  results  in  the  literature 
oroviding  accuracy  of  siailar  point  notching  prooleas  though 
related  point  aatching  techniques  have  apoaared  CFischler  and 
3ol les  19813* 

Through  auch  of  Section  4»  *e  focus  on  the  study  of 
screening  procedures  for  eliainating  abstract  vectors  uhlch  do 
not  contribu* %o  potential  registration  transforaations*  This 
allows  processing  of  a  substantial  oercentage  of  the  set  of  all 
possible  iaage  vectors*  thus  Insuring  that  aany  correct  aatching 
vectors  uill  be  given  to  tire  registration  procedure* 
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<♦•1  °oint  Image  Modelling 


Tne  modelling  of  point  feature  images  is  useful  in  the 
understanding  of  the  Ink  registration  procedure*  At  this  time*  a 
primary  practical  value  of  this  modelling  is  in  the  design  of 
simulation  studies  for  the  algorithm,  although  some  analytical 
results  are  presented*  Though  the  successes  of  image  modelling 
have  often  been  limited  by  the  inadequacies  of  the  models  in 
representing  rral-world  problems,  ve  think  the  simplicity  of  the 
point  image  input  to  the  abstract  vector  registration  procedure 
permits  the  development  of  applicable  models* 

Before  describing  any  formal  image  models,  we  discuss  those 
features  of  the  real-world  problem  which  we  think  are  likely  to 
affect  the  nerformance  of  the  registration  procedure*  The  models 
will  deal  only  with  images  consisting  of  a  small  number  of 
points,  typically  less  than  100*  of  uniform  intensity  on  a 
background  of  a  uniform  but  different  intensity.  Due  to  the 
dependence  of  the  number  of  points  on  both  image  content  and  the 
feature  extraction  procedure,  this  number  is  difficult  to 
estimate*  We  expect,  in  many  cases,  the  image  to  contain  far 
fewer  than  10(7  points*  The  map  may  contain  only  a  sample  of  the 
feature  points  occurring  on  a  real  map,  perhaps  as  few  as  five  or 
ten*  For  simplicity,  we  normalize  the  points  to  have  intensity 
one  and  the  background  to  have  intensity  zero* 

No  attempt  has  yet  been  made  to  analyze  the  probability  of 
feature  points  occurring  given  a  description  of  the  original  grey 
scale  image*  Such  an  analysis  would  treat  guestions  such  as  the 
probability  of  a  road  intersection  detector  failing  to  detect  *n 
intersection  if  one  were  present,  and  of  falsely  detecting  an 
intersection  when  none  was  present*  This  type  of  analysis  will 
not  be  feasible*  even  on  a  purely  empirical  basis,  until  we  have 
had  more  opportunity  to  study  feature  selection  procedures*  A 
second  type  of  empirical  study  of  use  in  modelling  would  be  the 
estimation  of  the  distribution  of  feature  points  on  m*ps  and 
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real-world  iaagery.  Here,  we  would  be  concerned  with  huaan 
selection  of  features  that  we  would  like  our  feature  point 
detectors  to  find*  Though  this  type  of  study  would  involve 
coaplex  saapling  probleas*  it  would  aid  greatly  in  refining  our 
registration  aodels* 

Our  use  of  the  tera  "point  iaage  aodelling"  has  been  loose* 
and  we  now  wish  to  clarify  natters*  Two  types  of  aatching 
probleas  are  considered  In  this  report:  laage-to-laage  aatching 
and  laage-to-aap  aatching*  The  analytic  foraulation  of  the  two 
probleas  is  siailar,  but  in  the  case  of  iaage-to-aap  aatching*  we 
shall  often  be  able  to  aak*  siaplifying  assuaptions  due  to  our 
confidence  that  nap  features  are  reliable*  As  a  result*  we  shall 
use  the  tera  "point  iaage"  to  refer  to  an  iaage  foraed  froa 
either  a  real  iaage  or  a  nap* 

The  essential  Idea  we  wish  to  capture  in  our  point  iaage 
aodelling  relates  to  the  types  of  iaage  noise*  cons*dtr  first 
the  case  of  aatching  an  iaage  to  a  nap*  We  assuae  that  certain 
points  have  been  selected  froa  the  aap  to  serve  as  feature 
points*  The  iaages  will  contain*  in  general*  soae  points 
corresponding  to  the  aap  points  and  soae  additional  points*  At 
this  point*  several  difficulties  are  present  in  the  problea* 
First*  not  all  iaage  points  correspond  to  aap  points*  and  vice 
versa  (by  correspond  we  aeanv  "correspond  to  the  saae  ground 
features")*  Second*  corresponding  iaage  and  aap  points  aay  not 
have  the  saae  positions  due  to  distortions  induced  by  elevation 
and  feature  extraction  inaccuracies*  A  generating  aechanisa  for 
creating  such  iaages  Is  to  create  a  set  of  aap  points  which  have 
corresponding  iaage  points*  Next  create  a  copy  of  these  points 
in  the  iaage  but  perturb  the  location  of  each  one*  Finally* 
generate  points  randoaly  in  the  aap  and  another  set  of  points 
randoaly  in  the  iaage  repnesenting  the  non-corresponding  points* 

The  procedure  dotcribad  above  for  generating  iaage-aap  pairs 
can  be  applied  to  generating  iaafle  pairs*  Soae  significant 
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differences  are  likely  to  exist  In  the  final  models*  for  by 
tuning  feature  extraction  procedures*  we  may  be  confident  that 
■ost  aap  points  will  occur  In  the  Image*  However*  there  may  be 
■any  laage  points  not  occurring  In  the  aap*  Thus  the  laage  aap 
problea  aay  be  simplified  by  assualng  no  noisy  points  exist  In 
the  aap*  This  saae  assumption  also  allows  us  to  select  a  saall 
number  of  aap  ponts  in  comparison  with  the  nuaber  of  laage  points 
(see  Section  4*7)* 

The  model*  as  stated  above*  requires  the  specification  of 
probability  distributions  for  generating  corresponding  points* 
perturbations  of  corresponding  points*  and  background  noise 
points*  A  siaple  aatheaatical  model  arises  If  we  generate 
corresponding  points  and  background  noise  points  uniformly  and 
perturb  corresponding  points  using  bivariate  Gaussian  noise* 
Since  this  aodel  is  the  basis  of  aany  of  our  siaulation  studies* 
the  plausibility  of  these  assumptions  will  now  be  discussed* 

The  assumption  that  corresponding  points  are  generated  by  a 
bivariate  uniform  distribution  is*  in  aany  cases*  clearly 
inappropriate.  Urban  areas  have  aany  feature  points  such  as  road 
intersections  which  are  reglarly  located*  For  such  area*  the 
assuaption  of  uniforaly  distributed  corresponding  points  aay  lead 
to  overly  optiaistic  performance  predictions*  For  more  rural 
areas  and  areas  In  which  natural  rather  than  aan-aade  point 
features  are  selected*  the  assuaption  of  uniformity  aay  be  aore 
tenable* 

The  assuaption  of  uniformity  for  the  distribution  of 
corresponding  points  aay  be  dropped  if  background  points  »re 
uniforaly  generated  and  we  assuae  we  are  aatching  an  laage  to  a 
aap  and  the  aap  has  a  slrarply  peaked  autocorrelation  function. 
Under  these  assuaptions  the  autocorrelation  of  the  aap  to  Itself 
will  not  contribute  significantly  to  registration  error.  Thus 
registration  error  will  result  primarily  froa  high  peaks  in  the 
cross~correlat1on  function  of  the  aap  with  the  background  noise. 


Since  the  map  and  the  noise  are  uncorrelated »  we  expect  high 
peaks  to  be  raref  though  this  topic  requires  further 
invest igat ion* 

The  above  observations  have  an  interesting  potential 
application*  By  applying  the  registration  procedure  to  two 
copies  of  the  map  and  coopering  the  oatch  weight  at  the  tioe  of 
t ransf oroat ion  with  the  oatch  weight  at  other  peaks  selected  by 
the  registration  procedure,  we  have  a  method  of  assessing  the 
extent  to  which  the  structure  of  the  map  is  likely  to  lead  to 
confusion*  If  it  were  then  possible  to  establish  the  expected 
matching  between  background  noise  points  and  the  map  points*  then 
criteria  could  be  set  for  determining  whether  or  not  a  successful 
match  has  been  made*  Such  criteria  would  be  extremely  useful  in 
iterative  feature  extraction  and  registration  procedures  (see 
Section  6)  in  which  the  decision  as  to  whether  or  not  to  do 
further  feature  extraction  is  based  on  the  success  of  the 
registration* 

The  assumption  that  image  or  noise  points  are  uniformly 
generated  nay  be  replacmd  by  the  assumption  of  a  Poisson 
distribution  for  the  points*  Observing  that  the  uniform 
distribution  of  a  fixed  number  of  points  is  the  same  as  a  poisson 
distribution  conditioned  on  the  number  of  points*  the  difference 
between  these  two  models  is  not  great*  we  have  chosen  the  former 
model  since  the  number  of  image  and  nap  points  will  be  known  at 
the  tine  of  registration*  thus  allowing  for  more  accurate 
performance  predictons.  With  a  Poisson  model*  the  performance 
predicton  is  averaged  over  various  numbers  of  points  occurring* 
A  further  limitation  of  the  Poisson  model  is  that  it  does  not 
allow  us  to  control  the  mean  and  variance  independently*  Thus* 
for  Instance,  as  we  increase  the  expected  number  of  map  points* 
we  increase  the  variance  of  the  number  of  nap  points  forcing  us 
to  average  performance  prediction  over  a  wide  range  of  numbers  of 
map  points* 

The  Poisson  image  modiel  is  a  limiting  case  of  a  binomial 
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inage  model  in  which  an  image  is  forned  by  generating  a  zero  or 
one  at  each  pixel  according  to  a  binomial  distribution.  This 
type  of  model  is  similar  to  the  Gaussian  model  used  for  gray 
scale  images  in  which  each  pixel  is  assumed  to  have  an  intensity 
generated  from  a  Gaussian  distribution. 

Just  as  the  binomial  image  model  nay  be  viewed  as  a  discrete 
version  of  the  Poisson  process  image  model*  a  set  of  points 
uniformly  distributed  in  a  finite  array  of  pixels  may  be  viewed 
as  the  discrete  analog  of  bhe  continuous  image  model  with  a  fixed 
number  of  points  uniformly  distributed. 

Alt  simulations  and  analyses  in  this  report  are  based  upon 
the  assumption  that  the  number  of  points  is  fixed  and  that  the 
points  have  coordinates  generated  from  a  uniform  random  number 
generator.  In  experimental  situations*  the  number  of  map  points 
is  small*  generally  ranging  from  five  to  ten  while  the  number  of 
image  points  ranged  from  zero  to  fifty*  depending  upon  the 
experiment.  These  valu<s  were  selected  for  several  reasons. 
First*  as  stated  above*  the*  map  used  for  matching  can  be  sparsely 
sampled  from  a  true  match*  so  we  are  free  to  choose  as  few  points 
as  possible  as  long  as  we  can  get  a  correct  transformation. 
Second*  by  placing  more  of  the  burden  on  the  feature  extractors* 
we  think  the  number  of  points  can  be  reduced  to  a  very  low  level. 
Third*  the  cost  of  exper iment ing  with  larger  parameter  values  did 
not  seem  justified  until  further  results  on  the  feature 
extraction  are  available. 
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4*2  The  LNK  Registration  Procedure  and  Correlation 

The  LNK  registration  procedure  may  in  some  cases  be  viewed 
as  a  rapid  procedure  for  computing  an  approximation  to  the  cross 
correlation  of  two  binary  .images*  In  this  section  we  describe 
this  relationship  in  detail  and  discuss  its  significance*  Owing 
to  the  paucity  of  useful  results  on  the  accuracy  analysis  of 
correlation  methods  for  registration*  this  relationship  serves 
more  to  highlight  the  difficulties  of  analysis  rather  than 
yielding  a  transfer  of  useful  theory*  Oespite  this  caveat*  this 
relationship  is  important  .in  the  comparative  evaluation  of  the 
LNK  procedure  and  other  registration  methods* 

The  view  of  the  LNK  procedure  as  a  correlation  method  may  be 
clearly  seen  in  the  following  situation.  Assume  we  are  given  two 
binary  images*  Ii  and  12*  for  which  seek  a  "good**  translation 
mapping  II  into  12,  Uitirout  further  information  concerning  the 
images*  we  can  have  no  notion  of  a  correct  t ransf oraat ion; 
however*  a  meaningful  comparison  of  the  results  of  the  LNK 
procedure  with  correlation  methods  may  be  made  if  we  are  only 
interested  in  the  similarity  of  their  conclusions* 

In  the  restricted  problem  under  eonf  i  derat  i&r- ,  the  LNK 
procedure  need  only  examine  translations  rather  than  general 
rigid  motion**  No  translation  will  be  considered  unless  it  maps 
at  least  one  abstract  vectqr  in  11  approximately  into  an  abstract 
vector  in  12*  If  the  match  were  required  to  be  exact*  then  the 
cross-corre lation  function  of  the  two  images  at  the  lag  given  by 
this  translation  would  have  a  value  of  at  least  two*  since  the 
Image  values  are  one  or  zerfo*  If  we  require  the  exact  matching 
of  vectors  to  determine  the  number  of  matches  for  a  given 
transformation,  then  the  connection  between  the  two  procedures 
nay  be  seen  immediately* 

The  LNK  procedure  computes  the  number  of  matching  vectors  in 
the  two  images  and  thus  implicitly  computes  the  number  of 
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matching  points.  To  calculate  the  number  of  matching  points, 
merely  total  the  number  of  distinct  points  in  II  which  are 
endpoints  of  matched  vectors*  Conversely,  given  the  set  of 
matching  points,  we  can  determine  the  number  of  matching  vectors* 
Since  the  cross-correlation  of  two  binary  images  represents  the 
number  of  matching  points  at  a  given  lag,  the  only  difference 
between  the  procedures  is  that  one  counts  matching  points  and  one 
counts  matching  vectors*  Sf  vectors  are  chosen  so  that  no  point 
is  an  endpoint  of  more  than  one  vector,  then  the  two  procedures 
yield  identical  results  far  this  translation  mapping  of  two 
binary  images* 

The  assumptions  made  in  comparing  the  outcomes  of  the  two 
procedures  bear  closer  scrutiny*  Clearly,  in  remote  sensing 
applications  some  tolerance  must  be  allowed  in  the  matching* 
Although  the  definitions  of  the  tolerances  in  the  LNK  procedure 
are  given  in  CStockman  19803,  they  are  somewhat  complex  and 
evolved  from  the  need  ta  match  real  vectors  (edges)  as  well  as 
abstract  vectors •  Nevertheless,  the  match  weight  computed  for  a 
pair  of  vectors  is  closely  related  to  the  matching  of  their 
endpoints*  The  total  match  weight  for  a  given  transformation  may 
be  viewed,  in  the  case  where  no  two  vectors  have  a  common 
endpoint,  as  the  sums  of  the  merits  of  point  matches  between  the 
two  images* 

Cross-correlation  is  bikely  to  perform  poorly  if  points  are 
sparsely  distributed*  A  variety  of  methods  may  be  employed  to 
circumvent  this  difficulty*  First,  the  image  nay  be  redefined 
using  a  coarser  grid  and  assigning  the  value  one  to  any  block 
which  contains  a  pixel  in  the  original  image  with  value  one* 
Second,  the  original  image  may  be  redefined  by  assigning  the 
value  one  to  all  pixels  within  a  fixed  distance  of  a  pixel  with 
value  one  in  the  original  image*  By  varying  the  block  size  or 
fixed  distance,  the  tolerances  can  be  adjusted* 

The  effect  of  these  two  image  modification  procedures  on  the 
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correlation  function  is  difficult  to  analyse  exactly.  Under  the 
f*rst  aethod  only  the  location  of  the  block  containing  a  point  is 
relevant*  nQt  its  location  within  the  block.  Thus  if  each  block 
is  d  units  on  a  side*  a  point  P  in  II  nay  map  into  a  location  in 
12  which  is  as  great  as  2d  units  fro*  a  point  8  in  12  and 
conribute  as  much  to  the  correlation  function  as  if  P  mapped 
exactly  into  0.  Thus  this  method  may  be  viewed  as  a  type  of 
thresholded  correlation  based  roughly  on  distance.  The  term 
threshold  applies  since  two  points  conribute  one  to  the 
correlation  function  if  they  are  close  and  zero  otherwise. 

The  second  ima9*  modification  procedure  assigns  to  each  pair 
of  approximately  c0p responding  points  a  measure  of  their 
closeness*  namely  the  area  of  intersection  of  two  discs*  one 
surrounding  each  point.  This  measure  is  approximately  monotonic 
as  a  function  of  the  distance  between  points*  but  this  may  vary 
slightly  with  the  distance  measure  chosen.  In  summary*  the  first 
method  provides  a  counting  measure  of  matches  while  the  second 
gives  a  sum  of  merits  from  the  matches. 

Both  the  LNK  procedure  and  cross~correlat ion  using 
tolerances  compute«  for  each  translation  for  which  there  are  at 
least  two  matches*  the  number  of  matches  and  the  sum  of  the 
merits  of  the  matches.  For  all  other  translations*  the  LNK 
procedure  implicitly  assigns  a  value  of  zero. 

The  analogy  between  the  LNK  procedure  and  correlation  of 
binary  images  is  used  in  Section  4.3  to  compute  characteristics 
of  the  LNK  procedure  using  correlation  analysis  methods.  We 
think  this  type  of  analysis  may  be  extended  considerably  using 
further  results  fro*  the  correlation  literature.  the  labelling 
of  points  may  also  be  included  in  a  correlation  model*  but  we 
have  not  yet  developed  the  point  of  view  adequately  to  allow 
analysis  of  registration  error. 
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4,3  Registration  Accuracy 


As  discussed  in  Section  4.2,  in  the  case  of  translation 
matching  of  binary  images,  the  L.N.K.  registration  algorithm 
provides  results  similar  to  those  obtained  using  the 
cros s-corre lat ion  function.  To  gain  a  deeper  understanding  of 
the  procedure,  a  theoretical  analysis  is  desirable*  The 
probabilistic  analysis  of  the  LNK  procedure  requires  the 
evaluation  of  certain  highly  complex  combinatorial  expressions, 
which  we  are  not  currently  able  to  evaluate*  in  consequence,  „e 
have  chosen  to  view  the  LNK  technique  as  a  rapid  procedure  for 
computing  the  cross-correlation  Qf  the  binary  pictures  or  of  a 
picture  and  a  map,  at  least  for  the  present  analysis* 

Although  the  remote  sensing  literature  contains  many  pacers 
analyzing  correlation  as  a  registration  technique  EMostafavi 
1979],  the  results  have  not  been  wholly  satisfactory*  A  crucial 
part  of  these  analyses,  of  course,  is  the  underlying  model  of  the 
image.  In  most  works  images  are  modelled  as  an  array  of  Gaussian 
processes*  The  measures  of  correlation  performance  which  a^e 
studied  using  this  type  of  model  have  some  desirable  properties, 
and  they  leave  many  important  questions  unanswered* 

The  basic  registration  model  assumes  the  presence  of  two 
images,  a  reference  image  and  a  sensed  image: 

reference  image  =  Ir(x)  *  P(x)  xe* 
sensor  image  =  Is(x)  *  ®d(x>  ♦  N(x) 

Here  M  denotes  the  set  of  points  in  the  image  plane  and  x  *  (xj  , 
x2  )  represents  a  point  in  this  plane.  P(x),  Pd(x>,  and  N(x) 
represent  random  variables  where  Plx)  gives  the  value  of  the 
reference  image  at  point  x,  Pd(x)  is  the  contribution  to  the 
pixel  value  in  the  sensed  image  due  to  the  fact  that  p(x)  has 
been  distorted  by  a  translation  and  n(x)  represents  the 
contribution  to  the  sensed  image  due  to  noise. 
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The  image  mocel  described  acove  assumes  each  image  is  one 
element  of  a  set  of  possible  images  forming  the  ensemble  of  a 
stochastic  process*  We  assume  that  Pd(x)  is  given  by 

Pd(x)  *  P (x  +  t ) 

where  t  is  a  vector  representing  a  translation* 

The  basic  Quantity  of  interest  in  this  analysis  is  the 
cross-correlation  function: 


C(t)=Els  <x-t) Irlx) 


where  the  sum  is  taken  over  the  picture  which  has  w  pixels*  Note 
that  for  fixed  t,  C<t)  is  a  random  variable  since  Is  and  Ir  are 
random  processes* 

One  goal  of  registration  analysis  is  to  compute  the 
probability  of  false  registration*  This  is  easy  to  define  in  the 
case  of  most  correlation  procedures  since  in  these  situations  a 
small  area  in  a  reference  image  is  being  correlated  with  windows 
in  the  sensed  image.  A  false  registration  is  a  peak  in  the 
correlation  function  indicating  a  match  between  a  segment  of  the 
reference  image  and  a  noncorresponding  window  in  the  sensed 
image*  Because*  in  the  discrete  case*  we  are  working  with  both 
images  in  their  entirety*  this  definition  must  be  modified. 
Clearly  a  registration  in  which  reference  image  points  are 
overlaid  only  on  noise  points  from  the  sensed  image  should  be 
regarded  as  a  false  registration*  But  how  should  we  regard  a 
registration  in  which  several  points  in  the  reference  image 
overlay  reference  points  in  the  sensed  image  but  not  correct 
points*  As  a  trivial  example*  if  the  reference  image  consists  of 
four  points  lying  at  the  corners  of  a  square*  then  there  are  four 
ways  in  which  we  get  a  perfect  overlay*  but  only  one  is  correct. 


up  estimate  some  bounds  on  this  type  of  error  using  simulation. 
In  practice,  any  gross  registration  error  will  arise  primarily 
from  the  map  matching  with  noise.  Essentially*  this  is  a 
restatement  of  the  assertion  that  for  very  soarse  point  images 
the  autocorrel at  ion  tends  to  be  very  sharply  peaked  and  have  no 
large  secondary  peaks* 

For  purposes  of  analysis  we  assume  the  autocorrel at  ion  of 
the  reference  image  takes  on  only  three  values:  n»  one,  and  aero, 
where  n  is  the  number  of  points  in  the  reference  image.  We 
justify  tMs  assumption  by  observing  that  in  1,000  sets  of 
reference  image  each  containing  five  uniformly  distributed 
points,  this  assertion  was  justified  over  500  times.  To 
appreciate  the  significance  of  this  50  percent  figure,  we  must 
consider  how  the  reference  image  or  map  is  formed. 

The  image  or  map  from  which  the  reference  image  is  formed 
willt  in  general,  contain  far  more  points  than  the  reference 
image.  As  an  example,  suppose  we  have  a  map  containing  forty 
points,  any  of  which  may  be  selected  for  the  reference  image*  If 
we  wish  to  select  five  points,  there  are  458,008  such  subsets* 
If  all  these  subsets  were  independent,  then,  since  each  subset 
selected  would  have  a  50  percent  chance  of  satisfying  our 
autocorr elat ion  property,  the  probability  of  not  finding  a 
suitable  subset  in  seven  independent  selections  of  five  points  is 
less  than  1  percent.  Though  the  subsets  are  not  independent, 
seven  disjoint  sets  of  five  points  could  be  selected  from  the 
for ty  points* 


The  above  arguments  could  be  developed  formally  to  get 
better  estimates  for  the  probability  of  selecting  a  good 
reference  image,  but  we  think  that  the  assumption  of  the  peaking 
of  the  autocorrelatl ve  function  of  the  reference  image  is  clearly 
reasonable*  In  practice,  the  selection  of  reference  images  with 
sharply  peaked  autocorrelation  functions  is  a  simple  matter* 
This  image  is  then  a  map  with  a  single  large  peak  in  the 
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autocorrelation  function.  we  need  only  select  a  small  subset  of 
map  points  which  has  the  property  that  no  two  interpoint 
distances  are  close*  This  image  is  then  a  mao  with  a  single 
large  peak  in  the  autocorrelation  function. 

Recall  that  this  digression  on  the  autocorre lat ion  function 
of  the  reference  image  was  introduced  to  clarify  the  notion  of  a 
false  registration.  In  summary(  we  have  concluded  that  the  total 
conrt r ibut ion  to  the  cross-correlation  function  far  from  the  true 
transformation  due  to  the  autocorr elat ion  of  its  reference  image 
is  no  greater  than  one.  Thus  we  are  justified  in  assuming  that  a 
false  registration  arises  almost  entirely  from  the  interaction  of 
the  points  in  the  reference  image  with  the  noise  points  in  the 
sensed  image. 

At  the  true  transformation  the  reference  points  should  align 
with  the  corresponding  image  points  so  the  number  of  aligned 
points  should  be  n,  the  number  of  reference  image  points.  We 
define  a  false  registration  to  be  a  peak  in  the  correlation 
function,  greater  than  the  value  of  the  peak  at  the  true 
transformationy  located  more  than  a  few  degrees  and  a  few  pixels 
from  the  oeak  corresponding  to  the  correct  transformation  such 
that  more  than  n  points  align  at  this  incorrect  peak.  As 
mentioned  before,  the  exact  number  of  pixels  and  degrees  remains 
to  be  determined,  but  five  degrees  and  ten  pixels  appears 
reasonable  based  on  experimental  data. 

We  now  turn  to  the  study  of  C(t)  where  it  is  far  from  the 
current  transformation.  For  such  values  of  t»  we  can  write  our 
correlation  function  in  the  following  form: 

C  <  t ) *CN ( t ) *  Z  Nlx-t)P(x) 

For  a  fixed  reference  image  P(x)  we  can  estimate  the  expected 
value  0f  C  ( t ) »  where  the  expectation  is  taken  over  an  ensemble  of 
sensed  images.  In  this  ensemble  only  the  noisy  points  vary  since 
P(x)  is  fixed  and  the  sensed  image  contains  a  copy  of  P(x>. 
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The  probability  distribution  of  the  random  variable  is 
difficult  to  compute  so  we  must  mane  an  assumption  to  estimate 
properties  of  the  distribution.  First  we  assume  that  the 

probability  oistrioution  of  CNlt)  is  independent  of  t.  This  is 

not  correct  since  the  area  of  overlap  between  the  reference  image 
anu  the  translated  noise  image  varies.  As  the  overlap  shrinks  to 
zero,  the  probability  that  C(t)  =  0  goes  to  one.  To  bound  the 
registration  error*  we  t a k e  the  opposite  extreme  and  assume 
complete  overlap  between  the  nolse  and  the  image.  This 
assumption  greatly  increases  the  probability  of  false 
registration  so  the  real  probability  of  false  registration  is 
much  lower  than  that  given  here. 

The  usefulness  of  probabilistic  properties  of  Clt)  requires 
some  comment.  At  the  correct  transformation*  t  »  each  reference 
image  point  lines  up  with  sensed  image  point  s  so  Cl*>  >  *  n  where 
n  is  the  number  of  points  tn  the  reference  image.  Far  from  t  » 

at  most  a  single  reference  point  can  line  up  with  the  image  of  a 

reference  point.  Thus  we  have 

Clt)  *  CNlt) 

or  Clt)  *  CNlt)+1 

depending  on  whether  or  not.  a  reference  point  lines  up.  A  false 
registration  occurs  if 

Cl*,  )  *  Clt) 

for  some  t  far  from  10.  Thus  a  fa.lse  registration  cannot  occur 
unless  for  some  t  far  from  tg, 

CNlt)  >  n-1 

in  light  of  th*  ab0ve  remarks*  we  see  that  an  upper  bound  on 
the  probability  of  false  registration  is  given  by  the  probability 
of  the  following  event: 


Max  CN(t>  >  n-1 


E : 

We  know  of  no  way  to  effectively  compute  the  probability  of  this 
event.  In  the  analysis  of  correlation  m0deis  for  Gaussian 
images  t  a  similar  problem  arises  and  the  approach  taken  is  to 
assume  that  the  probability  distribution  of  C(t)?  for  fixed  t, 
can  be  used  to  estimate  this  probability  of  the  event  E.  This 
formulation  assumes  that  the  values  of  C(t)  for  various  t's  in  a 
given  picture  can  oe  thought  of  as  samples  generated  according  to 
the  aistrioution  of  C(t)  tor  fixed  t  but  varying  over  the 
ensemole  of  noisy  pictures. 

Tnis  type  of  hypothesis?  referred  to  as  an  ergodic 
hypothesis,  is  common  in  iaage  processing  models.  If  we  further 
assume  a  functional  fora  for  the  aistribution  of  C(t),  then  we 
need  only  estimate  the  parameters  of  the  functional  form  to 
estimate  the  probability  of  false  regi stration.  In  the  Gaussian 
image  case,  C(t)  is  assumed  to  be  Gaussian,  so  only  the  first  and 
second  moments  need  be  estimated.  For  the  discrete  case  we  can 
bypass  the  assumptions  of  a  functional  form  and  directly  compute 
the  probability,  q,  of  the  following  event: 

F:  CN(t)  >  n-1 

If  w  is  the  number  of  pixels  in  the  image?  then  the  probability 
of  false  registration,  f,  is  given  by  (1-q)w. 

The  computation  of  the  probability  d  is  stra^ghtf orwarg*  We 
assume  the  reference  contains  n  points*  Assume  t h*  noisy  image 
contains  m  points.  Recall  that  CN<t>  is  a  random  variable  wh0se 
ensemble  is  formed  in  the  following  way.  Generate  a  noisy 
picture  and  shift  it  t  with  respect  to  the  reference  image.  Then 
count  the  number  of  pixels  which  contain  both  a  reference  point 
and  a  noise  point.  This  count  is  an  element  of  the  ensemble,  as 
discussed  earlier,  to  obtain  an  upper  bound  on  the  probability  of 
false  registration  we  omit  the  translation  and  assume  thr  two 
Images  overlap  completely. 

Referring  to  each  pixel  as  a  bin?  we  may  state  our  task  as  a 
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simple  discrete  probability  problem.  We  are  given  v  bins*  n  of 
which  are  occupied.  We  then  take  m  objects  and  place  each  one  in 
a  aifferent  bin,  selecting  the  bins  using  a  uniform  random  number 
generator.  The  probability  we  are  interested  in,  q,  is  the 
probability  that  at  least  n-1  bins  contains  both  one  of  the 
original  n  oojects  and  one  of  the  m  objects.  This  is  given  by 


(?-)  (J(n-l)  +  t) 


For  a  10G  x  100  image  with  five  reference  points  and  a  sensed 
image  with  25  noisy  points,  we  have  q  =  1.5  x  10~U  •  in  this 
case,  F,  the  probability  of  false  registration,  is  .0003. 

To  make  our  model  more  realistic  we  may  assume  the  reference 
image  contains  noise  points.  By  noise  points  we  mean  points 
which  represent  ground  features  but  which  fail  to  show  up  in  the 
sensed  io>age.  Keeping  the  notation  of  the  previous  problem,  we 
now  let  k  denote  the  number  of  noise  points  in  the  reference 
image.  with  k  noise  points,  Clt)  can  be  as  large  as  min  (n+k , 
m).  be  shall  assume  m  >  n+k.  yhe  correlation  of  the  current 
transformation  can  now  range  from  n  to  n+k.  we  shall  assume 
C(tg  )=n,  resulting  in  our  calculat ions ,  providing  an  even  greater 
overestimate  of  the  probability  of  false  regi s trat ion. 

We  are  still  interested  in  computing  the  probability  q,  but 
now  the  number  of  bins  occupied  has  risen  from  n  to  n+k.  Thus  q 
is  given  by 

Si  (w~+k) 

For  n,  m,  and  w,  as  in  the  previous  example,  and  k  «  5*  we  obtain 
q  *  1*2  x  1C~*  *  For  this  value  of  q  we  have  f  *  .0122.  The 
above  examples  show  the  extremely  high  reliability  of  discrete 
correlation  under  the  stated  assumptions.  A  10o  x  100  image  was 
selected  to  allow  for  a  coarse  resampling  of  a  typical  5I2  image 
as  suggested  in  Section  4.2*  Thus  approximate  matching  in  the 
finer  image  is  replaced  by  exact  matching  in  the  coarser  image. 
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ah  analysis  of  the  exoected  effect  of  this  coarse  sampling  on  the 
cros  s-co  r  re  lat  i  on  of  the  reference  image  with  the  non-~noisy 
position  of  the  sensed  image  would  be  useful  since  a  coarse 
sampling  could  reduce  C(t0  >  below  n« 


4.4  jparse  Correlation 


Various  aspects  Of  the  relationship  between  the  LNK 
registration  procedure  and  feature  point  correlation  are 
discussed  in  Section  4.2.  An  analytical  model  of  feature  point 
correlation  is  given  in  Section  4.3.  This  section  describes 
several  experiments  designed  to  estimate  the  registration  error 
in  discrete  correlation. 

Simulations  were  performed  on  one-  and  two-dimensional  data. 
The  image  was  represented  as  a  vector  of  length  1 qo  in  the 
one-di mens iona l  case  and  a  64  x  64  array  in  the  two-dimensional 
case.  A  set  of  points*  cabled  map  points*  was  then  distributed 
into  the  pixels  using  a  uniform  random  number  generator.  *  copy* 
called  the  image*  of  the  vector  or  array  was  made  and  additional 
points  called  noise  points  were  thrown  into  these  bins.  The 
c ross-corre lat ion  between  the  image  and  the  map  was  then 
computed.  For  each  such  correlation  the  value  of  the 
eross-corre lat ion  function  at  (he  peak  was  recorded  in  addition 
to  the  correctness  of  the  offset  corresponding  to  this  peak. 

The  problem  of  multiple'  points  in  bins  was  handled  slightly 
differently  in  the  one-  and  two-dimensional  cases.  In  the 
one-di mens i ona l  case  a  bin  assumed  the  value  of  one  if  one  or 
more  points  landed  in  it.  In  the  two-dimensional  case  the  value 
of  the  bin  was  the  number  of  points  contained  in  it.  The  former 
procedure  is  preferable  since  it  gives  a  better  approximation  to 
the  number  of  matches.  The  minimum  of  the  numbers  of  points  in 
the  map  and  image  bins  would  actually  be  the  best  measure  of  how 
well  two  bins  coincide*  but  for  speed  we  shoose  the  coarser 
method  in  the  two-dimensional  case,  we  think  the  differences 
between  these  schemes  ar.e  not  sufficient  to  cause  great 
differences  in  correlation  performance*  but  this  question  could 
be  resolved  with  a  modest  analytical  and  simulation  effort. 

The  results  for  the  one-dimensional  correlation  are  given  in 
Table  4*i*  Note  that  in  all  cases  the  peak  of  the 


Table  4.1  •  fine-d imens iona l  unno rma li zed  correlation  on  a  binary 
picture  of  length  -jQO  with  *  common  points  ana  N  noisy  points 
present*  The  correct  transformation  was  always  at  the  highest 
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c ro s s- co r re  l  a t i on  corresponds  to  toe  correct  transformation.  One 
purpose  in  investigating  the  one-aimensional  case  was  to  explore 
the  possibility  that  the  evaluation  of  a  transformation  in  the 
LMK  procedure  could  be  accelerated  by  performing  the  matching  on 
a  horizontal  or  vertical  projection  of  the  points  rather  than 
working  with  the  points  .in  two  spaces.  Since  the  registrations 
were  correct  in  all  cases*  we  have  no  indication  as  to  the 
limitations  of  correlation  in  one  dimension. 

The  results  of  the  two-dimensional  correlation  study  are 
shown  in  Table  4.2.  Matching  was  remarkably  accurate.  yith  as 

t 

few  as  three  common  points*  several  hundred  noise  points  were 
present  before  an  incorrect  transformation  was  selected,  with 
five  common  points  an  incorrect  transformation  was  never 
selected.  An  additional  Sqq  trials  were  run  in  which  the  map 
always  contained  five  points  and  the  image  had  25  additional 
noise  points.  In  the  Sqq  trials  the  current  transformation  was 
always  at  the  correlation  peak. 


Table  4.2  •  Two -dimensional  unnormalized  cross-correlation 
between  two  64  x  64  images.  The  value  of  a  pixel  is  the  number 
of  points  lying  within  the  pixels.  N  points  are  common  to  both 
images  and  N  noise  points  are  added  to  the  second  image. 

CORRECT  *  T  if  the  true  transf ormat ion  corresponds  to  the  peak  of 
the  correlation  function;  ft  otherwise 
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4.5  Distance  Matching 


The  speed  of  the  LMK  registration  procedure  is  dependent 
upon  its  ability  to  eliminate  as  Many  RT  transformations  as 
possible  as  soon  as  possible.  The  first  screening  is  to  check  to 
see  if  an  image  vector  and  a  nap  vector  natch  in  length  before 
examining  a  t ransf ornat ion  sending  one  into  the  other.  For 
evaluating  the  remainder  of  the  procedure*  it  is  useful  to  have 
knowledge  of  the  number  of  matches  that  are  likely  to  be  found  at 
this  first  stage.  This  section  describes  work  done  in  estimating 
the  distribution  of  vector  lengths  for  point  images  and  the 
number  of  matches. 

Ideally*  we  would  like  to  know  the  probability  distribution 
for  the  set  of  interpoint  distances  between  n  points  uniformly 
distributed  over  a  square.  This  could  aid  greatly  in  the 
selection  of  nap  vectors.  By  selecting  those  map  vectors  which 
have  lengths  less  likely  to  occur  in  a  random  set  of  points*  we 
nay  greatly  reduce  the  number  of  transf creations  to  be 
considered.  Intuitively*  it  is  clear  that  very  long  and  very 
short  vectors  are  unlikely  to  occur  in  a  point  image*  but  this 
observation  is  not  adequate  for  designing  an  abstract  vector 
selection  procedure. 

The  selection  of  map  vectors  requires  consideration  of 
several  conflicting  goals.  First*  the  number  of  map  vectors 
should  be  small  to  reduce  computation.  Second*  the  number  of  map 
vectors  should  be  large  enough  to  find  an  adequate  number  of 
correct  correspondences  between  image  vectors  and  nap  vectors. 
Third*  the  number  of  image  vectors  should  be  as  small  as 
possible.  Some  sampling  of  feature  points  or  abstract  vectors* 
as  described  in  Section  4.7*  is  usually  required  since  the  number 
of  possible  length  comparisons  made  in  initial  screening  goes  up 
as  nz  a2  where  n  and  n  are  the  number  of  image  and  nap  points 
respectively. 
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Various  aspects  of  the  vector  selection  problem  are  treated 
in  other  sections  of  this  report*  Section  4*8  describes  several 
schemes  for  using  labelling  of  points  to  select  vectors*  while 
Section  4*7  describes  probabilistic  considerations  in  randomly 
selecting  vectors  froa  a  feasible  set  of  vectors*  The  reaainder 
of  this  section  treats  the  role  of  length  in  the  selection  of  aap 
vectors* 

Attempts  to  derive  an  analytical  expression  for  the 
probability  of  various  interpoint  distances  occurring  or  the 
expected  number  of  times  a  range  of  distances  occurs  have  not 
been  successful*  The  basic  problea  in  the  analysis  appears  to  be 
the  tremendous  amount  of  dependency  among  the  interpoint 
distances*  Zn  general*  when  a  new  point  is  added  to  the  picture* 
its  location  is  determined  by  its  distance  froa  three  points* 
thus  making  all  other  distances  a  function  of  these  three*  The 
probablity  distribution  for  the  distance  between  two  points  in 
the  square  can  be  easily  calclated  by  treating  the  horizontal  and 
vertical  distances  independently  *  but  this  sheds  little  light  on 
the  general  case.  There  exists  a  large  amount  of  literature  on 
the  subject  of  the  distribution  of  random  geometric  objects 
CSantalo  19763*  but  we  haven't  yet  been  able  to  utilize  any  of 
this  work* 

The  distribution  of  interpoint  distances  for  an  image 
containing  25  points  was  estimated  using  simulation*  One  hundred 
pictures  were  generated*  each  occupying  a  512-by-512  square*  The 
distribution  of  the  interpoint  distances  is  given  in  Table  4*3* 
As  expected*  the  distribution  is  unimodel*  This  table  can  be 
used  to  obtain  a  rough  estimate  for  the  expected  number  of 
matches  between  a  map  vector  of  specified  length  and  the  image* 

The  distribution  in  Table  4*3  allows  only  a  rough 
approxmation  in  computing  the  expected  number  of  matches  between 
an  image  and  a  map*  A  study  was  performed  to  directly  estimate 
the  number  of  matches  based  solely  on  edge  length*  The 
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Table  4.3 
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Distribution  of  interpoint  distances  for  25  randoaly 


distributed  points  in  a  51Z  x  512  iaage. 
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experiment  consisted  of  25  trials*  In  each  trial  a  512  x  512  map 
consisting  of  five  uniformly  distributed  map  points  was  created* 
The  nap  was  then  rotated  5  degrees*  with  a  shift  of  IQ  pixels  in 
the  x  direction  and  20  in  the  y  direction*  Nap  points  were 
slightly  perturbed  by  adding  random  perturbations  to  the  i  and  y 
coordinates  of  each  point*  The  perturbations  are  generated  from 
a  Gaussian  distribution  of  mean  aero  and  variance  one*  Varying 
numbers  of  noise  points  were  added*  ranging  from  fifteen  to 
sixty*  All  interpoint  distances  among  the  image  points  and  among 
the  map  points  were  used*  For  each  trial*  each  image  interpoint 
distance  which  was  within  ten  pixels  of  a  nap  interpoint  distance 
contributed  one  to  the  total  number  of  transformations* 

In  each  trial  at  least  ten  of  the  transformations  arose  from 
the  correct  correspondences  between  map  edges*  Table  4*4 
provides  a  rough  estimate  of  the  size  of  the  registration 
problem*  Based  upon  the  number  of  transf oraations  observed  in 
this  experiment*  we  think  that  if  clustering  is  to  be  eliminated* 
a  much  more  selective  screening  process  is  required*  such  as  the 
point  labelling  described  in  Section  4*8*  These  simulations 
allow  us  to  torn  probabilistic  estimates  for  the  performance  of 
abstract  vector  selection*  In  Section  4*7  the  vector  sampling 
problem  was  considered  from  the  viewpoint  of  determining  the 
probability  of  selection  of  an  adequate  number  of  correctly 
matching  vectors*  Table  4*4  provides  an  estimate  of  the  total 
number  of  transformat  ions  to  be  considered* 

Based  on  Table  4*4  and  the  assumption  that  very  tong  vectors 
are  more  stable  than  very  short  ones*  an  apparently  reasonable 
vector  sampling  scheme  would  be  to  choose  only  tong  vectors* 
Unfortunately  *  this  scheme  fails  to  take  into  account  the 
relative  size  and  overlap  of  map  and  image*  If  the  map  is  much 
larger  than  the  image*  then  the  longer  map  vectors  won't  even  fit 
in  the  image*  Similarly*  if  the  overlap  between  nap  and  image  Is 
much  less  than  the  image  size*  then  the  maximal  length  of  a 
matching  vector  is  reduced*  If  the  image  and  map  are  both  square 
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Table  4.4  •  Number  of  image  edges  which  agree  in  length  within 
t<»n  of  a  nap  edge.  The  "'aDS  contain  five  randomly  generated 
points  and  the  image  contains  these  five  plus  N  additional  noise 
points. 
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and  one  is  horizontally  shifted  half  an  iaage  length  with  resoeet 
to  the  otherv  then  the  lenth  of  a  aaxiaal  vector  is  reduced  by  20 
percent. 

Suaaing  up  the  aerits  of  various  types  of  vectors,  we  aay 
aake  the  following  reaarks*  Short  vectors  are  good  because  they 
tend  to  have  fewer  spurious  Batches  but  bad  because  saall  changes 
in  their  endpoints  can  lead  to  large  changes  in  rotation  angle* 
nediua-length  vectors  are  better  than  short  vectors  with  regard 
to  the  rotation  problea  but  worse  with  regard  to  the  nuaber  of 
spurious  Batches  and  the  possibilities  of  lying  outside  the 
iaage-aap  intersection.  finally,  long  vectors  are  aost 
insensitive  to  endpoint  changes,  unlikely  to  produce  aany  false 
Batches  but  aore  likely  to  not  lie  in  the  iaage~aap  intersection* 

we  would  like  to  quantify  the  above  ideas  so  a  aerit  could 
be  assigned  to  each  vector*  Selection  of  vectors  could  then  be 
based  on  this  aerit*  Two  of  the  three  factors  can  be  easily 
quantified*  The  effect  of  vector  length  on  rotation  angle  could 
be  easily  siaulated*  The  nuaber  of  possible  Batches  can  be 
estiaated  using  Table  4.4  and  siailar  coaputations  with  varying 
nuabers  of  points*  Siaulations  can  also  be  perforaed  to  estiaate 
the  effect  of  vector  length  on  the  vector  lying  outside  the  iaage 
aap  overlap*  Assuaptions  on  the  type  of  overlap  between  iaage 
and  aap  aay  affect  this  estiaation  considerably*  If  the  only 
assuaption  on  overlap  is  the  approsiaate  aaount  of  coaaon  area, 
the  results  aay  vary  greatly  with  orientation*  We  have  just 
begun  study  on  these  questions* 

A  further  problea  in  the  evaluation  of  vectors  arises  if  the 
aerits  canot  be  considered  independently*  There  are  at  least  two 
ways  in  which  we  aay  have  Interaction  between  the  selected 
vectors*  first,  points  shared  between  vectors  result  in  fewer 
points  being  represented  by  the  vectors*  Second,  the 
distribution  of  vectors  in  the  iaage  is  iaportant*  If  aost 
vectors  selected  are  in  one  part  of  the  iaage,  we  aay  fail  to  get 
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enough  vectors  in  the  aap-iaage  intersect  ion*  Other  types  of 
dependence  aay  be  present*  In  spite  of  these  dependencies*  ue 
think  that  the  independence  assuaption  aay  be  reasonable  as  a 
starting  point  for  defining  a  aerit  function  for  vectors* 


4.o  Empirical  Registration  Accuracy 

This  section  describes  the  results  of  simulation  studies  in 
which  the  LNK  procedure  was  applied  to  abstract  vectors  formed 
from  randomly  generated  point  images.  we  were  interested  in 
analysing  the  deterioration  in  registration  accuracy  as  the 
number  of  non-corr espond ing  points  between  images  is  increased. 
The  results  of  this  section  served  to  bring  out  the  need  for  the 
study  of  sampling  and  labelling  procedures  as  described'  in 
Section  4  of  this  report. 

The  artificial  images  used  in  this  study  were  of  two  types. 
First*  a  point  image*  called  a  map  and  consisting  of  five  to  ten 
points  was  generated*  where  a  uniform  random  number  generator  was 
used  to  generate  the  x  and  y  coordinates  of  the  points.  Then  a 
perturbed  c0py*  called  the  image*  of  this  map  was  formed  by 
rotating  the.  image  five  de<grees*  and  translating  it  t*n  units  in 
the  x  direction  and  twenty  units  in  the  y  direction.  In 
addition*  the  x  and  y  coordinates  of  each  point  were  perturbed  by 
adding  independent  random  numbers  to  the  x  and.  y  coordinates 
where  the  random  numbers  were  generated  by  a  zero  mean  Gaussian 
process*  with  varying  variances.  Then  a  set  of  additional 
points*  called  noise  points  was  placed  in  the  image  by  uniformly 
generating  x  and  y  coordinates. 

The  LNK  registration  procedure  requires  abstract  vectors  to 
be  formed  from  the  feature  points  sQ  two  sample  schemes  were 
selected.  In  one  scheme  all  pairs  of  points  were  selected  as 
abstract  vectors  while  in  the  other  scheme  pairs  of  points  were 
randomly  selected.  For  each  pair  of  points  chosen*  one  was 
randomly  chosen  to  be  the  head  of  the  vector  and  the  other  to  be 
the  tail. 

The  match  weight  of  a  transf ormat ion  was  the  measure  of 
registration  quality  usied  for  determining  the  registration 
transf ormat ion.  It  weighs  both  the  average  separation  between 
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the  vectors  and  their  lengths.  Though  this  measure  of 
registration  quality  is  not.  ideal,  it  provided  sufficient  insight 
into  registration  accuracy  for  the  present  study.  for  further 
information  on  match  weights,  see  CStockman  19813. 

The  results  of  the  simulations  are  given  in  Table  4.5.  The 
success  of  a  given  trial,  is  determinea  by  looking  at  th*  ®»  x» 
and  y  columns  of  the  line  in  the  table  corresponding  to  the 
trial.  For  a  perfect  regi-st  ra  t  i  on ,  these  variables  should  assume 
the  values  5t  10,  and  2q  respect ive  ly .  These  variables  represent 
the  rotation  and  translation  corresponding  to  the  highest  match 
weight  t rans f ormat ion  examined  by  the  clustering  program,  where 
the  weight  is  given  in  column  wyP.  The  match  weight  of  the 
correct  transformation  is  given  in  Column  uTT.  In  cases  7,  8» 
33,  39,  41,  42,  and  46,  the  highest  match,  weight  transformat  ion 
Considered  by  the  clustering  routine  was  far  from  the  correct 
transf ormat ion,  but  the  match  weight  at  this  transformation  was 
significantly  lower  than  the  match  weight  at  the  correct 
transf ormat ion.  For  several  of  these  cases  detailed  examination 
of  the  program  output  led  to  the  conclusion  that  the  correct 
t ransf ormat ion  was  not  near  any  of  the  clusters  examined  by  the 
clustering  program. 

The  primary  problem  w.ith  the  clustering  program  appears  to 
be  in  the  way  in  which  bins  are  formed  in  the  search  for 
clusters.  To  guarantee  the  detection  of  cluster  peaks,  bins  must 
be  overlapped.  Otherwise,  a  cluster  can  be  spaced  over  several 
bins  without  enough  of  a  concentration  in  any  single  bin  to  be 
singled  out  as  a  potential  cluster  center,  ^he  cost  of  providing 
enough  bins  to  prevent  this  overlap  problem  can  be  high, 
especially  when  scaling  is  added  to  the  problem.  The 
consideration  of  more  clusters  could  be  obtained  by  lowering  the 
standards  for  a  cluster  peak,  but  this  also  adds  considerable 
cost  to  the  algorithm. 

In  addition  to  the  registration  failures  mentioned  above* 
incorrect  regi strat ions  were  obtained  in  cases  11,  12,  32,  33, 
34,.  35,  37,  44,  45,  and  47.  The  match  weight  at  the  correct 
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t r^ns format  ion  in  case  11  was  1990  while  the  next  highest 
matcnwei3ht  a*  the  c0rrect  transformation  was  917.  For  those 
cases  in  which  an  approximately  corpect  transf ormat ion  was 
predicted  by  the  program*  the  lowest  match  weight  at  the  correct 
transformation  was  1246*  thus  registration  failures  appeared  to. 
be  arising  from  two  problems.  First*  the  clustering  routine  may 
fail  to  label  the  correct  transformation  as  being  near  a  cluster 
peak*  even  though  the  matcJr  weight  at  that  transformation  is 
high.  Second*  the  match  weight  at  the  true  transformation  may  be 
low*  resulting  in  failure  to  recognize  this  as  a  cluster  peak. 

The  significance  of  the  error  sources  requires  further 
comment.  Only  in  1  case  out  of  47  did  an  error  occur  in  which 
the  match  weight  at  the  correct  transformation  was  substantial* 
say  above  10G0*  and  the  match  weight  at  the  best  transformation 
predicted  by  the  algorithm  was  larger.  Apart  from  errors  due  to 
the  clustering,  it  appears  that  errors  are  due  more  to  fewer 
correct  vectors  matching  than  to  too  many  incorrect  vector's 
matching. 

Further  corroboration  of  th«  hypothesis  that  a  laek  of  good 
matches  rather  than  too  many  bad  matches  creates  errors  nay  be 
founa  by  looking  at  the  match  weights  at  the  predicted 
registration  transformations  in  the  cases  where  an  error  occurred 
which  was  not  due  to  clustering.  Excluding  case  11*  because  its 
match  weight  at  the  correct  transf ormat ion  was  high*  the  highest 
match  weight  in  the  remaining  such  error  cases  was  1521  at  the 
predicted  transformation.  This  was  higher  than  the  correct  match 
weight  a?  the  true  trans format i on  for  only  five  of  the  thirty 
correct  registration  cases* 

By  setting  a  minimum  required  match  weight  of  2000  to  accept 
a  predicted  transf ormation*  we  obtain  25  successful 
regist rat  ions t  1  incorrect  registration*  and  26  regi strat ions 
rejected  due  to  inadequate  match  weight.  if  we  had  instead 
always  used  the  predicted  transformation  without  regard  for  a 
minimum  match  weight,  then  30  correct  regi strat ions  and  1? 
incorrect  regi strations  would  have  been  obtained,  these  figures 
are  used  to  indicate  the  level  of.  Improvement  available  in 
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registration  if  we  are  hilling  to  reject  registrations.  The 
actual  percentages  of  correct  and  incorrect  registrations  do  not 
reflect  on  the  algorithm  since  we  chose  values  for  the  numoer  of 
map  ano  image  points  to  provide  data  on  how  the  algorithm  behaves 
near  parameter  values  where  it  breaks  down. 

The  tradeoff  between  algorithm  accuracy  and  the  number  of 
noise  and  map  points  present  is  difficult  to  describe  with 
precision.  Using  'the  simple  random  selection  of  edgest  the 
program  generally  produced  correct  results  as  long  as  the  number 
of  noise  points  was  no  more  than  2.5  times  the  number  of  map 
points.  This  is  a  very  rough  guideline  which  we  did  not  feel 
merited  further  empirical  examination  until  a  better  analytical 
understanding  of  t rans f ormat ion  screening  was  prformed.  As  a 
result  of  the  studies  aescribed  in  Sections  A. 7  -  4.11,  we  feel 
that  this  factor  of  2.5  can  be  improved  considerably. 

The  cases  in  which  registration  failure  occurred  are  due 
primarily  to  an  inadequate  number  of  correct  image  vectors  being 
Tiven  to  the  registration  algorithm.  Our  studies  in  Sections  4.7 
-  4.11  indicate  that  a  much  larger  number  of  correct  image 
vectors  can  be  given  to  the  registration  program  if  more 
pre-screening  is  performed.  Including  point  labelling  and 
matching  of  triples  of  points.  The  former  procedures  require 
refinements  to  the  feature  selection  process  while  the  latter  can 
be  used  currently. 

The  cases  studied  in  this  section  presided  considerably  more 
data  than  is  given  in  Table  4.5.  For  each  case  the  match  weights 
are  given  for  several  cluster  peaks  for  several  levels  of 
clustering.  This  more  detailed  information  will  prove  useful  in 
setting  reasonable  criteria  for  accepting  or  rejecting  a 
registration  and  for  more  detailed  study  of  clustering  if  it  is 
to  be  retained  in  the  algorithm. 

we  have  dealt  only  with  the  factors  involved  in  getting  an 
approximately  correct  reg.i  strat  ion.  A  treatment  of  registration 
accuracy  given  that  an  approximate  registration  has  been  obtained 
involves  very  difficult  methods  of  analysis,  ue  did  not  think  it 
reasonable  to  examine  this  question  until  a  better  understanding 
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of  approximate  accuracy  was  obtained. 

In  summary,  tne  lnk  proceaure  is  very  robust  with  respect  to 
the  number  of  erroneous  image  vectors  as  long  as  an  adequate 
number  of  correct  vectors  are  present.  Using  somewhat  loose 
terminology,  it  is  far  more  important  in  obtaining  a  good 
registration  to  nave  a  correct  signal  having  at  least  a  certain 
minimum  strength  than  it  is  to  have  a  high  signa l-to-noi se  ratio* 
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Table  4.5  Registration  results  using  LNK  procedure  on  a  map  with  CP  points 

|  ana  an  image  *ith  CP+ftP  points  where  the  correct  t ransforma t ion  is  *5t 

*  =  1C»  y * 2 u .  The  image  contains  a  perturbed  copy  of  the  map  with  variance  V 
I  (see  text)  ana  NP  additional  noise  points.  IE  and  ME  are  tne  number  of 
*  image  and  map  vectors  usea.  A  denotes  all  vectors.  *Tt  is  the  match 

I  weight  at  the  correct  transformation  and  »iTP  is  the  the  predicted 

t ransf ormat ion  parameters.  The  remaining  columns  are  not  relevant  at 
,  pre  sent • 
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4.7  Vector  Sampling  for  Registration 


The  LNK  registration  procedure  does  not  currently  provide  a 
method  for  the  selection  of  abstract  vectors  from  feature  points. 
As  shown  in  Section  4.8*  labelling  of  point  features  can  lead  to 
a  significant  reduction  in  the  number  of  possible  abstract 
vectors.  In  spite  of  this  reduction*  it  may  not  be  feasible  to 
examine  the  full  set  of  t ransf ormat ions  determined  by  all  pairs 
of  matching  vectors.  Several  alternatives  to  the  examination  of 
all  transformations  are  possible.  First*  a  clustering  procedure 
can  be  applied  to  the  set  of  transformations  in  order  to  locate 
transf ormations  which  are  supported  by  many  matching  vectors. 
Second*  a  subset  of  vectors  may  be  randomly  selected  and  all 
possible  transformtions  determined  by  those  vectors  may  be 
examined.  This  section  treats  some  of  the  combinatorial  problems 
associated  with  these  schemes  in  order  to  aid  in  the  design  of 
procedures  for  selecting  trans f ormations . 

The  clustering  procedure  currently  used  in  the  LNK  scheme 
has  several  severe  limitations*  described  in  Section  5.  Apart 
from  the  possibility  of  totally  missing  the  correct  cluster*  as 
occurred  in  several  experiments*  the  clustering  procedure  is  slow 
enough  to  cause  a  bottleneck  in  the  registration  process  if  a 
large  number  of  abstract  vectors  are  used.  The  experiments 
described  in  Section  4,5  show  that  the  number  of  feasible 
transf ormations  can  grow  quite  rapidly  with  increases  in  the 
number  of  noise  points.  T^e  clustering  of  this  many 
transf ormations  puts  a  heavy  computational  burden  on  the 
registration  process. 

The  empirical  determination  of  the  histogram  of  interpoint 
distances  for  randomly  distributed  points  shows  that  the  lengths 
appear  to  have  a  unioodal  distribution.  Thus*  if  a  small  set  of 
points  is  selected  from  the  map,  the  vector  lengths  will  tend  to 
be  clumped  around  the  mean  and  many  image  vectors  will  tend  to 
match.  On  the  other  hand*  if  very  long  or  very  short  image 
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vectors  are  selected*  very  fen  matches  are  likely  to  be  present* 
Further  experimentation  should  be  done  to  determine  the  reduction 
in  the  number  0f  patches  when  l°n9  vectors  are  selected* 

A  variety  of  dete rm ini s t i c  schemes  can  be  used  to  limit  the 
number  of  vectors  to  be  used  for  matching.  These  procedures* 
including  selection  of  long  edges  and  labelling  of  point 
features*  may  reduce  the  number  of  vectors  significantly*  but  a 
large  number  of  vectors  may  remain  even  after  this  screening*  In 
addition*  the  screening  may  be  expensive,  especially  in  the  case 
of  point  feature  labelling*  If  the  number  of  vectors  remaining 
is  still  too  large  to  use*  some  random  sampling  of  the  vectors 
must  be  made*  Alternating  random  sampling  of  the  points  could  be 
done  • 

Each  procedure  for  screening  vectors  leads  to  a  different 
sampling  problem.  Only  a  few  of  the  problms  have  been  examined 
to  date*  The  basic  goal  of  our  sampling  procedure  should  be  to 
obtain  a  set  of  vectors  adequate  for  an  accurate  regi stration. 
Since  an  answer  to  the  Question  wou|d  requfre  a  deeper  analysis 
of  registration  accuracy  than  is  currently  available*  we  treat  a 
simpler  problem:  how  small  a  random  sample  of  image  vectors  can 
be  selected  which  will*  with  high  probability*  contain  at  least  a 
specified  number  of  map  vectors*  1*e*  image  vectors  corresponding 
to  map  vectors? 

we  now  describe  the  sample  problem  for  a  map  containing  f 
points  and  an  image  containing  v  points*  where  the  nap  points  are 
also  in  the  image.  The  number  of  map  vectors  is  f(f-1>*  since 
any  map  point  may  be  chosen  as  the  tail  of  a  map  vector  and  any 
of  the  remaining  points  may  be  chosen  as  the  head*  Similarly* 
the  i*a9*  contains  v<v-1)  vectors*  Many  of  these  image  vectors 
will  not  natch  with  any  map  vectors*  Let  p  be  the  number  of 
image  vectors  matching  with  at  least  one  nap  vector.  If  all  nap 
vectors  are  used*  then  k  of  P  vectors  represent  correct 
matches* 
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Several  sampling  procedures  can  be  applied  at  this  point* 
The  simplest  is  to  select  a  fixed  number  of  the  p  feasible  imaoe 
vectors*  A  more  sophisticated  procedure  would  be  to  associate  to 
each  mao  vector  the  set  of  image  vectors  associated  with  it  and 
to  sample  from  each  of  these  sets*  The  latter  procedure  appears 
promising*  for  if  several  of  the  sets  are  very  small*  they  may  be 
sampled  exhaustively*  thereby  guaranteeing  that  a  lower  bound  on 
the  number  of  correct  image  vectors  present  is  the  number  of 
small  sets* 

Both  of  the  above  sampling  procedures  are  analytically 
tractable*  For  the  first  procedure  we  randomly  select  a  subset 
containing  n  of  the  p  vectors*  Assume  we  wish  to  choose  n  so 
that  one  set  will  contain  at  least  s  map  vectors*  The 
-^probability  of  getting  exactly  t  nap  vectors  for  any  0  t  m  is 
easily  calculated  by  observing  that  a  sample  of  r  vectors 
containing  t  map  vectors  contains  r  -  t  image  vectors  which  are 
not  also  in  the  nap*  Since  there  are  a  total  of  n  mao  vectors* 
the  t  map  vectors  can  be  chosen  in  (  “)  ways*  Sinilary*  the 
non-nap  vectors  can  be  chosen  in  (  ?”?  >  ways*  As  there  are  (£) 
ways  to  choose  the  r  vectors*  the  probability  of  picking  exactly 
t  nap  vectors  is  given  by 

,<«>  -(fc) 

(?) 

where  oCx»t)  denotes  the  probability  of  picking  exactly  t  nap 
vectors* 

The  probability  density  p(x*t)  is  just  the  probability 
density  function  for  the  hyoergeonctric  distribution  with 
parameter  r*  m*  and  p  CJohnson  and  Katz*  19693*  To  denote  the 
explicit  dependence  of  p(x*t>  on  the  parameters  of  the 
distribution*  we  denote  p(x*t>  by  p(tf  r*m*p)*  Since  we  wish  to 
obtain  at  teases  map  vectors,  we  should  choose  r  so  that 
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o(t  r.a.o)  >  a 


where  b  *  «in  (a.r)  and  is  the  desired  confidence  level  and 
H(s;r.a.o)  is  the  probability  of  obtaining  at  least  s  aap 
vectors.  Noting  that  the  saae  hyoerqeoaetric  distribution  is 
obtained  if  r  and  a  are  interchanged.  a  nuaber  of  approsiaation 
orocedures  for  N(s;r.a.pl  aay  be  applied. 

The  quality  of  the  aopraxiaations  to  H(s;r«a.p)  depends  uoon 
the  relationship  between  the  paraaeters  of  the  distribution. 
Before  describing  these  approxiaat ions.  we  stress  that  the  point 
in  evaluating  H(s;r.a.p)  is  to  find  the  saallest  r  such  that 
hCs; r»a.e)> a •  Since  aopraxiaations  are  noraaly  given  in  teras 
of  M(ssr.a.p)  *  1-H(s;r.a.p) .  our  problea  is  to  find  the  saallest 
r  such  that  H(s;r.a.p)<1  -a.  We  know  of  no  analytical  procedure 
to  find  the  saallest  suitable  r.  Using  the  fact  that  M  is  a 
aonotonically  increasing  function  of  r»  a  siaple  binary  search 
for  the  saallest  r  is  possible. 

we  do  not  yet  have  coaputational  experience  with 
approxiaations  to  the  hyoerqeoaetric  function,  so  we  cannot  aake 
recoaaendations  as  to  a  suitable  choice  at  this  tiae.  The 
following  approxiaat ions  appear  potentially  relevant  CJohnson  and 
kotz  19693.  For  a  >  r.  which  can  always  be  obtained  (see  note 
followina  the  definition  of  M).  the  following  approsiaation  has 
been  constructed: 

H  Cs)  •  Z  p(t  |r,a,p)  Z  (r)(m/p>t  (l-CM/p))1"* 

t-0  t-0  v ' 

1  r (r-1) (a/p) (1-a/p)  (^+2)  (®/p) 8+2 (1-n/p Jr~*"2 

“  (a/p)*+1(l-a/p)r"8"1 

A  second  approsiaation  is  given  by 
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Z 

t-0 


f£>uJ  (l-*>  r“J 


where  w  -  (a-1/2s) /Cp-1/2r  ♦  tl/2). 

When  nore  understanding  of  the  ranqe  of  parameter  values 
occurrina  in  real  data  has  been  obtained*  we  can  make  a  decision 
as  to  whether  r  sould  be  coeputed  or  deterained  by  a  table 
lookup. 


In  the  other  saaolinq  procedure  to  be  considered*  each  aap 
vector  has  associated  with  it  a  set  of  ieage  vectors  of  siailar 
lenqths  and  each  set  is  saapled.  It  is  possible  for  an  iaaqe 
vector  to  watch  two  aap  vectors*  in  which  case  the  sets  are  not 
disjoint.  To  avoid  this  problea*  we  assuae  that  no  two  aap  edqes 
are  siailar  in  lenqth.  See  Section  4.5  for  a  discussion  of  the 
feasibility  of  such  a  selection. 

Our  basic  saapling  problea  way  be  stated  in  the  following 
ways  ye  are  given  d  sets  S^*.  •  .*Sd  of  iaaqe  vectors  which  fora 
a  partition  of  the  set  of  feasible  iaage  vectors.  Each  set  is 
guaranteed  to  contain  exactly  one  aap  vector.  Select  the 
saallest  nuabrr  of  saaples  r  such  that*  with  a  probability  of  at 
least  a  •  we  will  have  s  aap  edoes  in  the  saaple. 

The  selection  of  vectors  ffoa  each  set  is  now  analogous  to 
our  previous  saapling  procedure  except  that  the  nuaber  of  aap 
vectors  selected  can  only  be  zero  or  one.  One  aethgd  is  to 
select  the  s  saallest  sets  where  s  is  the  nuaber  of  aap  vectors 
we  wish  to  obtain  and  select  a  sufficiently  large  set  of  saaples 
so  that  the  probability  of  selecting  aap  vectors  froa  all  S* *% 
under  consideration  is  greater  than  a  •  As  far  as  a  single  set 
$1  is  concerned*  the  aap  contains  only  one  vector*  and  we  wish  to 
guarantee  with  qiven  probablity  that  the  vector  is  selected.  The 
orobablity  of  selecting  exactly  one  aap  vector  in  a  saaple  of 
size  r  is  given  by 
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Assuac*  without  loss  of  generality*  that  the  first  s  sets* 
Si,*  •  ..S8  are  the  smallest  sets.  Then  the  probability  of 
obtaininq  a  eap  edge  in  each  saeple  is  given  by 


a 

II  (r./n^) 
i-1 


we  wish 


to  select  the  r  's  so  as  to  ainialze 


s 

Z 

i-1 


ri 


subject  to  the  constraint  that 

a 

n  (r./n.i>a 
i-1  1 

We  are  not  aware  of  a  rapid  procedure  for  solving  this  problea* 
so  if  we  constrain  ourselves  further  to  saeple  the  saae  fraction 
of  each  set  so  r^/n^  is  the  saae  for  all  1*  and  this  auantity  is 
denoted  by  d*  we  have  d*  >ct  *  so  d  *a^s*  Tor  each  i*  define 
rt  to  be  the  saallest  integer  greater  than  or  eaual  to  njdp 

Other  saapling  scheaes  are  passible  to  handle  the  saapling 
froa  the  Si  's*  though  we  have  not  investigated  then.  The 
saapling  in  the  case  of  the  sets  clearly  reguires  a  very  large 
fraction  of  each  set  so  it  aay  be  best  to  just  use  all  of  the  s 
saallest  Si's.  If  the  nuaber  of  vectors  in  the  s  saallest  S^  's 


58 


is  very  larqe*  the  saapltnq  results  can  be  used  to  set  a 
threshold  above  which  registration  is  likely  to  fail*  so  tore 
sophisticated  procedures  such  as  feature  point  labellino  should 
be  attempted* 

One  limitation  on  the  above  saaplinq  procedures  is  based  on 
the  assumption  that  a  lower  bound  on  the  nuaber  of  aap  vectors 
found  in  the  iaage  should  be  deterained*  A  aore  useful  bound 
would  be  on  the  nuaber  of  aao  points  which  have  been  matched* 
ffatchinq  aao  points  rather  than  vectors  is  aore  aeaninqful  since 
the  vectors  are  generated  froa  the  points*  To  appreciate  the 
potential  difference  between  the  nuaber  of  aatched  vectors  and 
the  nuaber  of  aatched  points*  we  consider  two  extreae  cases*  If 
we  have  n  aao  vectors  which  have  to  be  aapped  and  all  have  an 
endpoint  in  coaaon*  then  only  n  -  1  points  have  been  aatched*  If 
no  two  vectors  have  an  endpoint  in  coaaon*  then  n  points  have 
been  aatched* 

Further  study  should  be  done  to  deteraine  the  relationship 
between  the  nuaber  of  aatched  vectors  and  aatched  points* 
Saaolinq  procedures  which  give  inforaation  on  the  distribution  of 
the  nuaber  of  aatched  points  can  be  defined*  but  extensive 
analysis  has  not  yet  been  completed*  A  simple  way  to  reduce  the 
problea  to  our  previous  sampling  problea  is  to  select  the  set  of 
aap  edqes  so  that  no  two  edges  have  a  point  in  coaaon* 

Soae  saaolinq  procedures  for  labelled  feature  points  can  be 
analyzed  easily*  If  all  points  are  labelled  as  in  Section  4*8* 
and  only  points  with  different  labels  are  joined*  then  we  aay 
view  the  vectors  as  being  divided  into  two  classes:  those  which 
are  also  aao  vectors*  and  those  which  are  not*  We  are  then 
interested  in  selecting  a  subset  which  is  sufficiently  large  to 
contan  a  speified  nuaber  of  aap  vectors*  The  analysis  is  now 
identical  to  that  in  our  first  randoa  vector  selection  procedure* 


4. a  Feature  Point  Labelling 

The  speed  and  accura cy  of  the  LNK  registration  procedure  is 
due  to  the  algorithm's  ability  to  eii«inate  consideration  of  many 
infeasible  transformations  in  contrast  to  ordinary  correlation 
procedures.  Rather  than  allowing  all  possible  transformations 
and  using  the  data  to  select  the  most  suitable  one  in  standard 
correlation  compu tat i ons *  the  set  of  transf ormat ions-  to  be 
considered  is  generated  by  the  data,  placing  more  structure  upon 
the  data  and  using  this  additional  structure  reduces  the  number 
of  transformations  considered  and  hence  increases  the  speed  and 
accuracy  of  the  procedure. 

A  simple  but  effective  enhancement  of  the  LNK  procedure 
results  if  each  feature  point  has  a  type  or  label  associated  with 
it.  This  type  may  be  defined  in  many  ways.  If  the  feature  point 
is  an  intersection  of  lines  such  as  roads*  the  number  of  lines 
entering  the  intersection*  the  angle  at  the  intersection*  and  the 
classification  of  pixels  in  a  neighborhood  of  the  intersection 
may  all  be  used  to  categorize  this  point. 

i 

The  labelling  of  feature  points  can  result  in  great  speedups 
in  the  matching  of  abstract  vectors*  but  this  must  be  balanced 
against  the  cost  and  accuracy  of  the  labelling.  At  one  extreme* 
enough  information  about  a  point  may  be  available  to  avoid  any 
confusion  of  this  point  with  any  other  point  in  the  image.  A 
matching-  of  two  such  points  with  the  corresponding  points  in  a 
second  image  or  map  is  adequate  to  determine  the  RT 
t ransf ormat ion*  thus  reducing  the  remainder  of  the  abstract 
vector  matching  to  . a  triviality. 

ue  do  not*  at  present-*  have  adequate  information  on  the 
cost-performance  tradeoffs  associated  with  feature  point  typing. 
Crude  approaches  like  the  noravec  interest  operator  or  spot 
detectors  are  unlikely  to  be  directly  useful  ior  accurate 
labelling  CLambird  193 q3  .  However*  they  can  be  extremely  fast 
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ana*  in  some  cases*  may  make  excellent  use  of  parallel 
architecture.  ("ore  elaborate  feature  point  detectors,  such  as 
the  LNK  intersection  procedures  (Section  3)  may  provide  far  more 
information  about  the  point*  but  the  cost  may  fee  considerable. 

The  selection  of  an  algorithm  for  typing  points  and  an 
appropriate  modification  of  the  LNK  proceoure  t.  use  this  typing 
will  require  an  assessment  of  several  factors*  First*  how 
rapialy  and  accurately  point  features  can  be  obtained.  Second, 
how  much  this  labelling  disambiguates  the  points.  *any  points 
may  have  similar  neighborhoods,  so  local  operations  may  yiel<j 
only  a  few  types  of  points.  Third,  what  improvements  will  result 
in  the  abstract  vector  approach  as  a  result  of  various 
laoellings.  For  example*,  if  an  image  contains  twenty  feature 
points,  what  advantage  results  from  being  able  to  divide  the 
twenty  points  into  four  types,  each  containing  five  points*  as 
opposed  to  two  groups  of  ten  points  each. 

while  all  three  of  these  facets  of  labelling  must  ultimately 
be  addressed,  we  could  not  address  the  second  issue*  the  extent 
of  oi sambi guat ion  possible,  until  the  feature  point  selection 
algorithms  (see  Section  3)  were  further  developed  and  tested. 
For  the  remainder  of  this  section,  we  assume  image  and  map 
feature  points  have  been  extracted  and  consider  the  effects  of 
varying  the  number  of  types  and  the  number  of  points  of  each 
type. 


The  LNK  reg  i  s  t-r  at  i  ve  procedure  involves  several  major  steps, 
each  of  which  is  affected  by  the  typing  of  points.  For  the 
analysis  in  this  section,  we  regard  the  basic  steps  as:  1) 
creation  of  abstract  vectors*  assuming  point  features  are  given; 
2)  selection  of  transf ormations  to  be  clustered;  3)  clustering  of 
t ransf ormat ions ;  4)  evaluation  of  each  clustered  transformation; 
and  5>  Selection  of  the  beat  transformation  and  an  assessment  of 
the  correctness  of  the  transformation. 


Step  5)  requires  explanation  since,  in  practice,  it  would 
seem  reasonable  to  select  the  transformat  ion  associated  with  the 
highest  match  weight  as  the  best  transformat  ion;  and  if  this 
transformation  leads  to  incons i s tenc i es  in  later  processing,  to 
try  to  perturb  it  slightly  to  get  the  next-best  match  weight  or 
other  acceptable  trans format  ion.  Although  this  procedure  is 
reasonable  if  all  point  features  are  obtained  before  abstract 
vectors  are  formed,  the  interleaving  of  registration  with  feature 
point  selection  described  in  Section  7  requires  an  evaluation  of 
a  registration  tr ansformat ion  based  upon  a  small  number  of 
feature  points  to  determine  if  more  feature  points  should  be 
sought  • 

The  first  step  of  tf»e  LNK  procedure,  the  selection  of 
abstract  vectors,  may  be  greatly  affected  by  the  typing  of 
points*  As  we  have  seen  in  Section  4*7,  several  obvious 
strategies  for  selecting  abstract  vectors  from  feature  points  may 
be  readily  defined.  Each  of  these  selection  strategies  leads  to 
several  new  selection  strategies  when  typed  points  are  available. 
Important  questions  in  the  practical  analysis  of  certain  of  these 
strategies  are  unresolved,  and  the  partial  analysis  of  all  would 
lead  to  a  profusion  of  results  without  providing  an  adequate 
basis  for  comparison*  Consequently,  the  present  discussion  will 
focus  on  the  simpler  pr„cedures  for  which  more  readily 
interpretable  results  are  available* 

The  first  case  we  consider  is  the  image-to-map  registration 
problem  in  which  the  number  of  map  points  is  sufficiently  small 
so  that  every  pair  of  map  points  defines  an  abstract  vector. 
Furthermore,  we  assume  the  number  of  image  points  is  sufficiently 
large  that  not  all  pairs  of  image  points  should  be  used  as 
abstract  vectors*  Let  n  denote  the  number  of  map  points,  m  the 
number  of  image  points,  and  k  the  number  of  image  edges  selected. 

The  above  situation  was  analyzed  in  Section  4*7,  where  the 
expected  number  of  image  edges  selected  which  corresponded  to  map 
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edges  was  computed.  The  comparison  of  this  result  with  the 
expected  numter  of  edges  for  typed  matches  depends  on  the  scheme 
for  using  the  typing.  Several  possibilities  will  be  considered 
anp  their  performances  compared* 


A  simple  abstract  vector  selection  scheme  is  to  connect  all 
pairs  of  points  of  the  same  type  by  an  abstract  vector  and  to 
connect  no  points  of  different  types.  If  the  m  image  points  are 
divided  into  p  groups,  , .  .  . Gp ,  where  group  contains  q* 
points,  then  the  total  number  of  image  edges  selected  is 


P 

m  E 

The  total  number  of  image  edges  is  2  where  m  *  I«1  •  As  a 

special  case,  assume  all  groups  are  the  same  size,  i.e.,  q ,  3  q2 
-  •  •  .=Qp •  Then  jER,  defined  to  be  the  ratio  of  the  number  of 
image  edges  using  typed  point  to  the  total  number  of  image  edges, 
is  given  by 


for  example,  if  m  *  30,  p  *  6»  we  have  IER  3  60/435,  resulting  in 
a  ten  percent  reduction  in  the  number  of  image  edges  to  be 
considered.  Given  that  p  groups  have  been  selected,  it  is 
natural  to  ask  how  IER  varies  as  the  distribution  of  points  in 
groups  varies. 

With  the  aid  of  a  simple  combinatorial  identity,  it  is  easy 
to  see  that,  for  fixed  p  and  m  where  p  divides  m,  IE#  is 
minimized  when  01  3  d2  *  •  •  *»dp»  The  identity  is  closely 
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related  to  the  tact  that  the  geometric  mean  0f  two  numbers  is 
less  than  or  equal  to  the  arithmetic  mean. 

Prop.  For  any  integers  a  and  b  with  a  >  b»  we  have  the  following 
i nequa li ty 


t1)  >  (0 + (0 


Proof:  a  >  b-1 

2a  >  2b”2 

a-3b-*>2  >  -a-b 

a  ♦a+b  -3b  +  2  >  a  -a*b  -b 

(a-M)a  +  (b-1 )  <b-2)  >  a  (a-1 )  ♦b(b-1 ) 


[a+1)  ! 

[a-lTfl 


+  (b-D; 

*  /  1-  O  \  t 


(b-3) 121 


(a- 2) 12! 


(b-2) 12! 


If  we  have  two  grouos  of  points  with  a  points  and  b  points 
respectively  with  a  >  b,  and  if  all  points  within  a  group  are 
joined  by  edges,  the  proposition  tells  us  that  the  total  number 
of  edges  increases  as  the  difference  between  a  and  b  increases. 
This  shows  that  if  we  have  only  two  types  of  points  in  the  image* 
the  maximum  reduction  in  tJie  number  of  eoges  occurs  when  the  two 
groups  are  of  equal  size*  The  general  case  of  p  groups  can  be 
handled  by  induction* 

The  selection  scheme  which  selects  edges  by  joining  points 
of  similar  types  has  been  shown  to  perform  best  when  all  groups 
are  of  equal  size  and  the  percent  improvement  edge  ratio  has  been 
computed*  Simplifying  the  expression  for  jER  *n  tb®  case  of  p 
groups  of  equal  size,  we  get 

IER  -  WP-I) 

B-1 
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For  large  m  and  p  much  smaller  than  mt  we  have  m/p-1  arn/p  and 
m/(m-1 )  asl ,  so  IER  &  1/p,  Hence  in  this  case  the  division  of 
points  into  p  groups  of  eaual  size  reduces  the  total  number  of 
edges  by  a  factor  of  p.  This  reduction  factor  is»  indirectly*  a 
measure  of  the  speedup  of  the  procedure  resulting  from  labelling 
points.  Since  the  speedup  deteriorates  as  groups  become  more 
unequal  in  size*  it  is  useful  to  find  a  procedure  which  performs 
better  in  the  case  of  great  variation  in  group  size.  To  make 
effective  use  of  this  pair  of  procedures*  a  measure  of  group  size 
equality  would  be  aesirable  to  help  select  the  appropriate 
procedure. 

The  joining  of  points  within  groups  results  in  an  edge 
selection  procedure  which  is  effective  when  the  groups  are  of 
eaual  size*  so  it  is  natural  to  consider  whether  joining  points 
between  groups  might  be  more  effective  when  the  groups  are  of 
unequal  size.  The  obvious  analog  is  to  join  two  points  by  an 
abstract  vector  if  the  points  do  not  lie  in  the  same  group. 
These  two  approaches  are  complementary  since  every  abstract  edge 
joining  two  points  of  the  image  join  either  two  points  in  the 
same  group  or  two  points  in  different  groups.  Thus  a  simple 
procedure  for  selecting  between  the  two  edge  selection  procedures 
is  to  compute  the  number  of  edges  in  the  two  cases  and  select  the 
procedure  which  uses  the  smaller  number  of  edges. 

If  we  fix  the  number  of  groups  at  p*  then  it  is  easy  to  show 
that  the  maximum  number*  d*  of  within  group  edges  occurs  when  all 
but  one  group  contains  exactly  one  point.  This  is  easily  seen  to 
be  the  grouping  which  yields  the  minimum  number  of  between  group 
edges.  The  worst  case  performance  tor  the  two  procedures  occurs 
when  the  number  of  intergroup  edges  is  equal  to  the  number  of 
intra-group  edges.  This  occurs  when  each  set  of  edges  is 
ore-half  the  total  number  of  edges*  i»e,,  the  number  of  edges  is 
m/2,  resulting  in  an  IER  of  1/2,  Thus  by  an  appropriate  choice 
of  edge  selection  routines*  the  number  of  edges  can  be  reduced  by 
a  factor  ranging  from  1/2  to  approximately  1/p  where  p  is  the 


Table  4*6  *  Number  of  matching  vectors  between  a  5-point  map  and 
an  image  containing  the  map  and  N  noise  points*  Each  map  point 
has  a  distinct  label  t  and  those  labels  are  randomly  distributed 
among  the  noise  points.  Two  edges  match  if  their  endpoint  types 
match  and  their  lengths  differ  by  less  than  ten* 


N 

#  of  Hatches 

N 

#  of  Hatches 

is 

17 

40 

41 

15 

12 

40 

45 

15 

13 

40 

44 

15 

19 

40 

33 

15 

16 

40 

54 

15 

17 

50 

54 

15 

15 

50 

79 

20 

20 

50 

78 

20 

21 

50 

63 

20 

23 

so 

60 

20 

20 

50 

69 

2C 

15 

30 

20 

30 

33 

30 

29 

30 

38 

30 

32 
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4*9  Point  Labelling  Experiment 


Labelled  feature  points  appear  promising  for  reducing  the 
number  of  RT  transf ormat ions  which  must  be  evaluated*  This 
section  describes  an  experiment  designed  to  estimate  the  average 
reduction*  Within  aost  of  the  range  of  parameters  investigated* 
the  reduction  in  the  number  of  transformat  ions  is  sufficient  to 
allow  all  remaining  transformations  to  be  evaluated  in  detail* 
thus  eliminating  the  need  for  clustering* 

The  experiment  consisted  of  two  phases*  The  first  phase 
consisted  of  an  eapirieal  estimation  of  the  average  number  of 
matches  between  image  and  nap  vectors*  Each  trial  consisted  of  a 
randomly  generated  mao  containing  five  points  and  randomly 
generated  images  containing  a  rotated*  translated  perturbed  of 
the  five  nap  points  in  addition  to  background  noise  points*  The 
horizontal  and  vertical  coordinates  of  all  points  were  generated 
according  to  a  uniform  dfstr ibut ion*  In  copying  the  map  points 
into  the  image*  the  x  and  y  coordinates  were  perturbed  by  adding 
vectors  generated  from  a  bivariate  gaussian  distribution  of  mean 
zero  and  variance  one*  Both  image  and  mao  were  512  x  512  pixels* 

A  count  of  the  number  of  matching  interpoint  distances 
between  image  and  map  was  computed*  A  pair  of  map  points  was 
declared  to  match  a  pair  of  image  points  it  the  vector  joining 
the  image  points  differed  in  lengths  by  no  more  than  ten  from  the 
vector  joining  the  map  points*  The  results  are  given  in  Table 
4*6* 


The  matching  of  the  nap  with  the  perturbed  version  of  the  ; 
nap  will  lead*  with  very  high  probability*  to  at  least  ten  ' 
matches*  Since  there  are  ten  nap  edges*  more  matches  may  arise j 
if  the  map  has  non-unique  edge  lengths*  The  percentage  of j 
correct  matches  (approximately  ten)  to  the  total  number  ofl 
matches  varies  from  about  20  percent  to  1*5  percent*  Based  on/ 
the  results  in  Table  4*6*  exhaustive  examination  of 
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transf ormations  determined  by  the  labelling  restriction  is 
feasible  for  naps  containing  as  nany  as  forty  noise  points*  it 
nay  actually  be  feasible  for  larger  nuebers  of  noise  points*  but 
■ore  work  oust  be  done  in  evaluating  the  cost  of  finding 
■atchweights  before  this  can  be  determined* 

This  phase  of  the  esperinent  provides  useful  information  in 
several  ways*  First*  it  serves  as  a  benchmark  against  which  we 
may  compare  the  reduction  in  matches  resulting  from  labelling* 
Second*  the  results  provide  a  measure  of  comparison  in  evaluating 
the  desirability  of  clustering  transformations  versus  eaamining 
all  transformat ions  corresponding  to  matches*  Since  the  tine 
required  for  matching  vectors  depends  in  nany  cases  only  upon  the 
number  of  matches  to  be  performed*  and  not  on  the  lengths  of  the 
vectors*  an  estimate  of  the  comparison  time  per  vector  can  be 
used  in  conjunction  with  Table  4*6  to  obtain  an  estimate  of  the 
tine  required  by  the  registration  algorithm*  A  third  use  of  the 
table  is  in  the  analysis  of  random  sanoling  of  matching  edges  to 
speed  up  the  registration  program  (see  Section  4*71* 


4.13  Triples  for  hatching 


The  first  phase  of  the  lnk  registration  procedure  determines 
those  transf ormat ions  for  which  at  least  one  image  vector  matches 
with  a  mao  vector.  This  is  equivalent  to  determining  those 
transformations  for  which  a  pair  of  m»P  points  matches  a  Pair  of 
image  points.  To  reduce  the  number  of  t ransformat ions  *e  may 
further  restrict  our  attention  to  those  t ransf ormat i ons  which 
result  in  at  least  three  points  matching  approximately.  This 
secion  describes  some  theoretical  considerations  in  evaluating 
the  relative  efficiency  of  matching  triples  versus  pairs  of 
points • 

The  average  number  of  matching  triples  between  a  " aP  and  an 
image  is  difficult  to  determine  ana lyt i ca l ly »  though  some  useful 
results  are  easily  described*  Assume  we  have  a  map  containing  m 
points  and  an  image  containing  n  points  including  the  m  map 
points.  There  are  (  "  J  *  m(m  -  1)/2  common  pairs  of  points  at 
the  correct  transformation  due  to  all  the  map  points  overlaying. 
There  are  <  3  }  *  m(m  -  1)<m  -  2>/$  common  triples  of  points  at 
the  correct  transformation.  Thus  the  number  of  triples  is  Cm 
2)/3  times  as  great  as  the  number  of  pairs  at  the  correct 
transformation.  Similarly,  for  any  transf ormation  for  which  K 
points  correspondi  the  number  of  matching  triples  is  (k  -  2>/3 
times  the  number  matching  pairs.  Thus  if  k±  transf ormat ions 
result  in  exactly  i  points  matching,  the  number  of  matching 
triples  is 


T  -  “  k±  i  < i  -  1)Ci  -  2>/6 
±-3 


while  the  number  of  matching  pairs  is 


1  i/Z 


p 


i(i  - 


We  now  consider  the  above  problem  in  the  soecial  case  where 
the  number  of  map  points  is  five*  we  ha *e 


T  a  Xj  ♦  4  k*  ■*  10 k5 

Pa  Kz  *  3ks  +  *  10ks 


The  selection  of  triples  rather  than  pairs  of  matching 
points  is  justified  when  P  -  T  is  sufficiently  large*  Note  that 
P  -  T  *  2k  ♦  2k  ♦  k  ,  so  the  number  of  pairs  is  at  least  as 
large  as  the  number  of  triples*  more  generally  we  have  the 
following : 

P  -  T  *  k,  ♦  "  k  Ui  -  1)  11/2)  Ci  ♦  Ci  -  2>/33. 

2  i-3 


Thus  the  number  of  pairs  is  never  less  than  the  number  of 
triples*  If  the  number  of  image  and  map  points  are  fixed  at  n 
ana  m,  respect ively*  then  the  expected  value  of  P  -  T  can  be 

comouted  from  the  expected  value  of  k  !  (k2»*  •  •  *kn).  In  the 

case  m  *  5»  we  need  only  compute  the  expected  value  of 

(k2  *  kj  fk%  )  * 

The  analytical  results  on  triples  matching  are  not  yet 

adequate  to  enable  us  to  draw  conclusions  on  the  efficiency  of 

the  use  of  triples  for  transformation  screening*  The 

experimental  results  on  triples  matching*  given  in  Section  4.11* 
indicate  that  triples  are  likely  to  be  highly  effective  in 
reducing  the  number  of  t ransf ormat  ion*  which  must  be  examined* 
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4*11  Matching  Triples  Experiment 

The  theoretical  analysis  of  the  effectiveness  of  Matching 
|  triples  of  points  to  determine  a  registration  transformation  has 

not  yet  reached  a  stage  where  the  expected  number  of  eatches  can 
|  be  found.  A  simulation  study  was  conducted  to  estimate  the 

number  of  matching  triples  for  fixed  nap  size  and  varying  image 
size* 

The  simulation  consisted  of  twenty  trials  divided  into  four 
groups  of  five  trials  each*  For  each  trial  a  nap  containing  five 
points  was  generated  using  a  uniform  random  number  generator  for 
the  x  and  y  coordinates  of  all  points*  A  perturbed  version  of 
each  point  was  then  copied  into  the  nap  where  the  perturbation 
consisted  of  adding  random  numbers  generated  by  a  mean  zero* 
variance  one  normal  process  to  the  x  and  y  coordinates*  Random 
noise  points  were  then  addjrd  to  the  image*  Xn  each  group  of  five 
trials*  the  number  of  image  and  nap  points  were  fixed*  Only  the 
locations  of  image  and  map  points  varied*  The  number  of  image 
points  over  an  trials  ranged  from  10  to  25  in  increments  of  five 
points* 

Two  types  of  matching  were  computed  for  each  trial*  the 
number  of  matching  pairs  and  the  number  of  matching  triples*  A 
matching  pair  (<a«b>t(ctd)>'  is  a  pair  of  map  points  a  and  b  and  a 
pair  of  image  points  d  and  e  such  that  the  distance  from  a  to  b 
differed  by  no  more  than  fiive  from  the  distance  from  d  to  e*  A 
matching  triple  is  a  set  of  three  map  points  a*  b*  and  c  and 
three  image  points  d*  e*  and  f  such  that  the  following  are 
matching  pairs: 

<<a«b)»(d*eJ) 

<<a,e),<d,f*> 

C(b*c) *<e*f *) 

Thus  the  triangles  formed  by  the  two  triples  are  approximately 
congruent* 

The  results  of  the  simulation  are  given  in  Table  for 

i  each  image  map  pair*  the  ratio  of  the  number  of  matching  pairs  to 
the  number  of  matching  triples  was  computed*  For  each  group  of 
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five  trials  In  which  the  miaber  of  Iaage  points  was  constant*  the 
five  ratios  were  arranged.  The  average  ratios  are  approxiaately 
a  linear  function  of  the  nuaber  of  points*  In  the  range  of 
paraaeter  values  studied*,  we  aay  state  that  the  following 
relationship  holds  approxiaately : 


R  *  .IN  ♦  1-.19, 


where  R  Is  the  ratio  of  the  nuaber  of  aatchlng  pairs  to  the 
nuaber  of  aatchlng  triples  for  N  n01se  points  in  the  1a«ge* 

The  above  results  have  several  applications*  F1r*tt  1n  the 
range  of  paraaeter  values  studied*  the  selection  of  aatchlng 
triples  reduces  the  nuaber  of  possible  transforaatlons 
sufficiently  to  perait  exhaustive  enuaeration*  On  an  iaage  with 
20  noise  points,  S  aap  points*  and  i00  iaage  edges  randoaly 
selected,  the  registration  with  the  clustering  prograa  would 
often  evaluate  20  to  30  transforaatlons  explicitly  while  the 
triples  screening  procedure  produced  an  average  of  32 
transforaat ions  to  exaalne  for  the  saae  type  of  iaage*  Thus  the 
triples  screening  provided  only  slightly  less  data  reduction  than 
the  clustering*  In  view  of  the  liaitations  on  clustering 
discussed  In  Section  5*  this  suggests  replaceaent  of  clustering 
by  triples  screening  if  this  can  be  done  rapidly  enough* 

A  second  useful  piece  of  inforaation  gained  froa  this  study 
is  the  approxiaate  nuaerical  relationship  between  the  nuaber  of 
noise  points  and  the  ratio  between  the  nuaber  of  aatchlng  triples 
and  aatchlng  pairs*  Zn  trying  to  design  a  fast  registration 
algoritha*  we  would  like  to  balance  the  cost  of  successful  but 
relatively  expensive  transf oraation  screening  such  as  triple 
aatchlng  versus  cheaper  bub  less  effective  screening  such  as  pair 
aatchlng*  If  we  can  estiaate  the  coaputational  cost  of  pair 
aatchlng  and  triple  aatchlng  as  a  function  of  the  nuaber  of  noise 
points*  the  approxiaate  eowetion  allows  us  to  assign  to  each 
nuaber  of  noise  points  the  expected  nuaber  of  transforaatlons  and 
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the  expecteo  cost  resulting  from  matching  pairs  and  triples* 
This*  in  conjuntion  with  an  estimate  of  the  cost  of  evaluating  a 
transformation*  would  allow  us  to  determine  the  preferable 
screenin9  method  and  estimate  the  average  running  time  of  the 
a  Igor i thm  • 


Table  4.7  Number  of  matching  pairs  (?)  of  points,  number  of 
matching  triples  (T)  of  points  and  ratio  of  P  to  T  (P/T)  for  a 
five  point  map  and  a  5 ♦N  point  image* 


5*  Clustering 


The  clustering  algorithm  in  the  LNK  registration  procedure, 
described  briefly  in  Section  2*  has  provided  reasonable  results 
la  aany  of  the  experiment*  described  in  this  report  and  in 
earlier  LNK  studies  CStoakaan  19813*  Despite  this  we  feel  that 
in  situations  in  which  abstract  vectors  are  used*  the  clustering 
can  and  should  be  avoided* 

There  are  several  reasons  for  avoiding  clustering  whenever 
possible*  First*  as  seen  in  Section  4*6*  the  current  clustering 
procedure  aay  fail  to  detect  a  peak  near  the  true  transforaation* 
This  situation  can  arise  in  several  ways*  To  avoid  losing 
clusters  due  to  clusters  being  spread  over  several  bias  without 
adequate  concentration  in  any  one  bin*  the  clustering  prograa 
uses  overlapping  bins*  Unfortunately*  aore  overlapping  bins  aust 
be  used  to  avoid  this  f ragaentat ion.  As  the  number  of  parameters 
to  be  estimated  grows*  as  when  we  add  scale  and  perspective 
changes*  the  nuaber  of  overlapping  bins  grows  rapidly*  thus 
slowing  down  the  clustering  considerably*  We  do  not  yet  have  any 
information  as  to  the  frequency  of  occurrence  of  this  phcnoaenon 
other  than  to  observe  that  It  has  occurred. 

The  f ragaentation  aay  be  avoided  to  some  degree  by  searching 
for  aore  peaks  in  the  cluster  space*  At  present*  only  three 
peaks  are  being  used*  More  peaks  aay  be  considered*  although 
this  can  increase  the  coaputational  cost  significantly  when  the 
clustering  proceeds  to  deeper  levels* 

A  second  reason  for  avoiding  the  clustering  is  that  the 
clustering  cannot  be  easihy  adapted  to  an  iterative  registration 
scheae  in  which  the  clustering  based  on  a  set  of  points  aust  be 
revised  as  new  points  are  added  (see  Section  6)*  It  is  possible 
that  Iterative  clustering  way  work*  but  we  think  that  aore  effort 
In  screening  out  infeasible  transformations  is  preferable  to  the 
use  of  clustering  for  picking  out  likely  transforaations* 

We  think  that  analysis  of  the  clustering  algoritha  cannot 
effectively  be  carried  out  until  inforaation  has  been  gathered 
about  the  effects  In  real  data  of  sampling  and  labelling  as 
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outlined  in  Section  4.  The  theoretical  analysis  of  ctusterinng 
procedures  has  produced  almost  no  useful  results  in  the 
clustering  literature,  so  if  clustering  is  necessary*  analysis 
will  be  primarily  empirical. 
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6*  Iterative  Registration 

The  high  cost  of  feature  point  extraction  suggests 
consideration  of  registration  procedures  which  perform  feature 
extraction  on  a  limited  portion  of  the  image  and  attempt 
registration  based  on  the  output.  If  the  results  are  not 
sati sfac tory ,  additional  portions  of  the  image  should  be  examined 
and  the  registration  should  be  attempted  once  again*  This 
process  would  continue  until  either  an  acceptable  registration  is 
achieved  or  the  entire  image  has  been  examined  for  point 
features • 

Three  basic  issues  must  be  resolved  before  this  iterative 
type  of  procedure  can  be  made  feasible*  First?  a  selection 
procedure  must  be  designed  to  determine  which  portions  of  the 
image  are  examined  on  successive  iterations*  Second*  the 
procedure  for  determining  which  registration  transf ormations  to 
examine  should  be  modified  to  allow  the  procedure  to  use  results 
from  previous  steps  of  the  iteration*  Third,  a  termination 
criterion  must  be  designed  for  acceptance  or  rejection  of  a 
registration  transformation  determined  after  an  iteration  of  the 
procedure* 

There  are  two  main  classes  of  procedures  for  selecting  which 
portion  of  the  image  should  be  examined  after  an  iteration* 
First.,  the  image  can  be  partitioned  into  square  windows*  An 
unexamined  window  could  be  selected  at  random  after  each 
iteration*  In  the  second  class  of  methods,  the  best  registration 
transformation  from  the  previous  iteration  can  be  used  as  an 
estimate  of  the  true  transformation*  If  an  image  is  being 
registered  to  a  mapf  this  estimate  can  then  be  used  to  determine 
where  other  map  feature  points  are  likely  to  appear  in  the  image* 
A  program  for  performing  this  second  method  is  currently  under 
development,  though  in  light  of  other  results  in  this  report,  we 
feel  the  method  of  random  window  selection  appears  more  fruitful, 
and  attention  should  be  turned  in  this  direction* 

The  main  reason  for  preferring  random  window  selection  is 
that  the  INK  procedure  tends  to  produce  t ransf ormat ions  which  are 
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either  fairly  accurate,  say  to  within  a  few  pixels  and  a  few 
degrees,  or  t rans f orma t ions  which  are  very  far  off  (see  Section 
4*6).  Thus  the  estimated  t ransf ormation  in  the  second  method  is 
likely  to  be  sufficiently  bad  as  to  be  worthless  or  sufficiently 
good  that  no  further  iterations  are  required.  For  fine  tuning 
the  registration  or  verification  of  the  registration 
transformation,  the  registration  transformation  nay  be  used  to 
seek  image  features  correspondi ng  to  map  features,  but  this  is  a 
one-step  process. 

The  modification  of  the  registration  procedure  to  Use 
information  from  previous  iterations  to  determine  which 
transformat  ions  to  examine  can  be  handled  in  two  ways.  First» 
some  sort  of  sequential  clustering  procedure  could  be  defined. 
Such  a  procedure  would  cluster  a  set  of  transformations  and  then 
update  the  clustering  when  a  new  set  c *  transformations  were 
added.  This  type  of  approach  appears  risky  since  updating 
procedures  would  tend  to  look  at  characterizations  of  the  points 
being  clustered  rather  than  the  points  themselves.  These 
characterizations  may  include  cluster  centers  and  radii.  Thus 
the  updated  clusters  may  be  much  less  reliable  than  a  direct 
clustering  on  all  points. 

A  second  procedure  for  using  information  fro*  previous 
transformations  is  available  if  the  number  of  feasible 
transformat ions  is  sufficiently  small  that  exhaustive  search  is 
possible.  Based  upon  the  results  of  Section  4.  we  think  that 
efforts  should  be  concentrated  in  this  direction.  As  new  feature 
points  are  added  on  successive  iterations,  the  updating  of  the 
list  of  transformations  is  a  simple  matter.  Procedures  for 
updating  the  match  weight  assigned  to  transformat ions  are  more 
tricky,  but  we  think  they  can  be  rapid. 

We  have  not  yet  investigated  termination  criteria  for 
iterative  registrat ion.  It  was  felt  that  more  experience  on  real 
data  using  the  inersection  detection  program  should  be  obtained 
before  reasonable  criteria  can  be  set. 
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Conclusions  and  Recommendations 


This  study  was  directed  toward  the  analysis  of  the  LNk 
registration  proceduret  both  to  understand  its  performance  in  its 
present  fora  and  to  investigate  aodif ications  which  nay  result  in 
iaproved  registration  accuracy  and  speed*  For  purposes  of 
analysis*  we  have  focused  on  theoretical  aethods  and  siaulation 
to  gain  knowledge  of  the  relative  merits  of  various  facets  of  the 
algoritha*  we  now  summacide  the  priaary  results  of  this  work* 

The  curve  intersection  detection  algoritha  of  Section  3*2 
appears  to  be  a  feasible  approach  to  the  extraction  of  feature 
points*  thus  enhancing  the  straight  line  intersection  detector 
currently  available*  This  procedure  should  now  be  tested  on 
aerial  iaagery  to  obtain  information  on  characteristics  of  the 
feature  points  obtained* 

The  LnK  procedure*  viewed  as  an  approai a*t i on  to  a 
cor  relation-based  method  for  registration  of  point  images*  has 
remarkably  high  accuracy  in  locating  the  correct  t ransformat ion. 
The  investigations  described  in  Sections  4*2*  4.3*  and  4.4 
provide  both  empirical  and  analytical  evidence  for  this 
conclusion* 

According  to  the  study  in  Section  4*6*  th«r«  are  two  main 
causes  of  registration  error  using  the  LNK  procedure*  First*  the 
clustering  routine  nay  faiU  to  detect  the  true  transformation  due 
to  the  cluster's  containing  this  transformation  being  split  over 
several  bins*  Second*  the  number  of  correctly  matching  vectors 
may  be  too  small  to  give  a  high  match  weight  to  the  correct 
transformation* 

The  second  problem  may  be  alleviated  by  more  elaborate 
screening  including  using  matching  triples  of  points*  labels  of 
points*  selection  of  map  vectors  based  on  good  length  properties* 
and  other  methods  described  in  Sections  4*7  -  4*11*  Each  of 
these  methods  can  significantly  reduce  the  number  of  vectors  to 
be  handled  by  the  current  registration  program*  thus  increasing 
the  expected  ratio  of  correctly  matching  vectors  to  all  input 
vectors*  In  the  range  of  parameter  values  considered*  several  of 
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these  procedures  provided  enough  reduction  in  the  nueber  of 
•etching  vectors  to  allow  elimination  of  the  clustering  prograe* 
thereby  elieinating  one  source  of  error* 

Combinations  of  serdening  Methods  such  as  using  labelled 
matching  triples  of  points  were  not  studied,  but  based  on  the 
present  work,  they  appear  hikely  to  improve  registration  accuracy 
still  further*  The  potential  of  such  combinations  is  great,  but 
the  computational  cost  of  examining  triples  must  be  examined 
carefully,  for  Its  worst-case  performance  can  deteriorate  rapidly 
as  the  number  of  map  points  increases* 

The  performance  of  the  registration  algorithm  is  dependent 
upon  the  way  in  which  the  map  feature  points  *re  selected  from 
the  map*  It  is  both  feasible  and  desirable  to  select  map  vectors 
so  no  two  are  similar  in  length*  further  conditions  on  map 
vector  selection  can  be  devised  to  Increase  the  likelihood  of  the 
registration  procedure  providing  an  accurate  registration 
transf ornat fan. 

Further  study  of  the  LNK  registration  procedure  should 
Include  both  analytical  and  experimental  research.  The  most 
pressing  need  is  the  acquisition  of  feature  point  images  derived 
from  aerial  Imagery  using  our  intersection  detection  routines* 
Naps  of  the  corresponding  areas  would  be  useful*  These  point 
images  would  allow  extensive  testing  of  the  Ideas  developed  In 
this  report* 

The  present  analysis  has  focussed  on  the  estimation  of 
rotation  and  translation  parameters  using  the  LNK  procedure*  The 
introduction  of  scale  and  perspective  distortions  leads  to 
greater  complexity  which  may  require  different  methods  of 
analysis*  In  particular,  clustering  eay  be  essential  in  this 
more  general  situation*  A  study  of  the  extent  to  which 
complexity  is  Increased  by  scale  and  perspective  distortions 
would  be  valuable* 

New  methods  for  reducing  the  number  of  matching  vectors  or 


81 


triangles  should  be  studied  in  the  case  where  scale  and 
perspective  distortions  arc  present*  A  study  should  be  cade  of 
properties  of  geoaetric  figures  which  are  invariant  under  these 
two  types  of  distortions.*  This  would  aid  us  in  designing 
procedures  for  recognizing  geoaetric  conf igurations  which  could 
not  aatch  under  any  scale  or  perspective  change*  Thfs  could 
reaove  auch  of  the  load  froa  the  clustering  routine* 

The  randoa  saapling  of  the  iaage  suggested  in  Section  6 
Should  now  be  explored  in  detail*  Criteria  for  accepting  a 
tentative  registration  Should  be  designed  and  tested*  Updating 
algorithas  for  deteraiiring  aatch  weights  and  feasible 
transf oraat ions  should  also  be  designed* 

Siaulation  experiaents  with  3-diaensional  aatching  should  be 
conducted*  Attention  should  be  given  to  the  potential  obscuring 
of  feature  points  due  so  viewing  angle*  The  creation  of 
artificial  terrain  aodels  would  aid  us  in  perforaing 
experiaent at  ion  along  these  lines* 
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